Azure OpenAI Service

AzureAI & ML

Enterprise access to OpenAI models including GPT-4, GPT-3.5, and DALL-E with Azure security, private networking, regional deployments, and pay-as-you-go or provisioned throughput

Attributes

SLA Uptime
99.9%
Content Filtering
Yes
Private Endpoints
Yes

Sub-services (3)

Chat Completions

Conversational AI with GPT models

Embeddings

Generate vector representations of text

DALL-E

AI image generation from text prompts

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPRSOC 2ISO 27001HIPAAPCI DSSFedRAMPG-Cloud

Where this runs

73 regions
36 countries
13sovereign
Sovereign regions (13)
  • Australia Central · CanberraAzure Australia Government
  • Australia Central 2 · CanberraAzure Australia Government
  • US Gov Virginia · VirginiaAzure Government (US)
  • US Gov Arizona · ArizonaAzure Government (US)
  • US Gov Texas · TexasAzure Government (US)
  • US DoD East · VirginiaAzure Government Secret (US)
  • US DoD Central · IowaAzure Government Secret (US)
  • China North (Beijing) · BeijingMicrosoft Azure China (21Vianet)
  • China East (Shanghai) · ShanghaiMicrosoft Azure China (21Vianet)
  • China North 2 · BeijingMicrosoft Azure China (21Vianet)
  • China East 2 · ShanghaiMicrosoft Azure China (21Vianet)
  • China North 3 · HebeiMicrosoft Azure China (21Vianet)
  • China East 3 · ShanghaiMicrosoft Azure China (21Vianet)
Commercial regions (60)

Europe (21)

  • Austria East
  • Belgium Central
  • Denmark East
  • Finland Central
  • France South
  • France Central
  • Germany North
  • Germany West Central
  • Greece Central
  • North Europe
  • Italy North
  • West Europe
  • Norway East
  • Norway West
  • Poland Central
  • Spain Central
  • Sweden Central
  • Switzerland West
  • Switzerland North
  • UK West
  • UK South

North America (13)

  • Canada East
  • Canada Central
  • Mexico Central
  • West US
  • East US 3
  • North Central US
  • Central US
  • West US 3
  • South Central US
  • East US
  • East US 2
  • West US 2
  • West Central US

South America (3)

  • Brazil Southeast
  • Brazil South
  • Chile Central

Asia (13)

  • East Asia
  • South India
  • Jio India West
  • West India
  • Jio India Central
  • Central India
  • Indonesia Central
  • Japan West
  • Japan East
  • Malaysia West
  • Southeast Asia
  • Korea South
  • Korea Central

Oceania (3)

  • Australia East
  • Australia Southeast
  • New Zealand North

Middle East (5)

  • Israel Central
  • Qatar Central
  • Saudi Arabia Central
  • UAE Central
  • UAE North

Africa (2)

  • South Africa West
  • South Africa North

Tags

Equivalent services on other platforms

Alibaba Qwen (Tongyi Qianwen)Alibaba

Alibaba's flagship open-source foundation model family covering Qwen (text), Qwen-VL (vision-language), Qwen-Audio, and Qwen-Coder — accessible via the DashScope API with chat, completion, embeddings, and function-calling endpoints

Amazon BedrockAWS

Build generative AI applications with foundation models from Anthropic (Claude Opus 4.7 from April 2026), Cohere, Meta, Mistral, Stability AI, TwelveLabs (video understanding), and Amazon's own Nova family — accessed via a single API with fine-tuning, knowledge bases, agents, and a model marketplace for discovery and easy onboarding

Amazon NovaAWS

AWS-built foundation model family covering text (Micro, Lite, Pro, Premier), image generation (Canvas), and video generation (Reel) — accessed through the Bedrock runtime with tight pricing and low-latency streaming, launched at re:Invent 2024

Amazon QAWS

Generative AI assistant family spanning software development (Q Developer, formerly CodeWhisperer), enterprise knowledge retrieval (Q Business), low-code app generation (Q Apps), and contact-centre augmentation (Q in Connect) with grounded answers against your own data

Amazon Bedrock AgentCoreAWS

Production runtime for AI agents — managed memory, identity, gateway, observability, and tool integration so teams can ship agentic workflows on top of any framework (Strands Agents, LangGraph, CrewAI, vendor-direct) without rebuilding the operational substrate

Amazon Nova ActAWS

Foundation-model service for browser-based agents — purpose-built model and SDK for AI agents that automate form fill, search, booking, QA testing, and other web-UI workflows, distinct from the general-purpose Nova family by being action-oriented rather than chat-oriented

Cloudflare Workers AICloudflare

Serverless GPU-backed AI inference at the edge running a catalogue of open-source text, image, speech, and embedding models (Llama, Mistral, Stable Diffusion, Whisper, BGE) with pay-per-neurone pricing and direct binding from Workers code

Mosaic AIDatabricks

End-to-end AI platform (formerly MLflow + Mosaic ML) for training, fine-tuning, deploying, and monitoring foundation models and custom ML models on the Lakehouse

Vertex AIGCP

Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio

Gemini APIGCP

Direct API access to Google's most capable multimodal AI models with text, image, audio, and video understanding, long context windows, and function calling support

IBM watsonx.aiIBM

Enterprise AI studio for training, validating, tuning, and deploying foundation models and traditional ML models, with IBM's Granite model family, Hugging Face integration, prompt lab, synthetic data generation, and governance via watsonx.governance

Kanana AIKakao

Kakao's Korean-first foundation-model family (Kanana Flash / Essence / Nano) for chat, code, and embeddings — multilingual but tuned for Korean conversational performance

CLOVA StudioNaver

Naver's HyperCLOVA X foundation-model platform for Korean-language LLM workloads — chat completion, embeddings, function calling, RAG over Korean text with strong native-language performance

OCI Generative AIOracle

Fully managed service offering Cohere and Llama large language models

Generative APIsScaleway

Managed inference for open-source LLMs (Llama, Mistral, DeepSeek) hosted in EU datacentres

Snowflake CortexSnowflake

Fully managed AI and ML service offering hosted LLMs, vector search, and ML functions inside Snowflake SQL

Tencent HunyuanTencent

Tencent's in-house family of large language models (Hunyuan-Pro, Standard, Lite, plus multimodal Hunyuan-Vision) accessible via the Hunyuan API, with enterprise-grade context windows up to 256K, function calling, embeddings, and tuning

Pricing

Pricing model:pay-per-token