Amazon Bedrock

AWSAI & ML

Build generative AI applications with foundation models from Anthropic (Claude Opus 4.7 from April 2026), Cohere, Meta, Mistral, Stability AI, TwelveLabs (video understanding), and Amazon's own Nova family — accessed via a single API with fine-tuning, knowledge bases, agents, and a model marketplace for discovery and easy onboarding

Attributes

Serverless
Yes
Fine Tuning
Yes
Guardrails
Yes
Video Understanding
Yes
Model Marketplace
Yes

Sub-services (6)

Model Access

Foundation models from Anthropic, Cohere, Meta, Mistral, Stability AI, Amazon, and TwelveLabs (video)

Knowledge Bases

Retrieval-augmented generation with enterprise data, with multimodal RAG and S3 Vectors backend

Bedrock Agents

Build autonomous AI agents that execute multi-step tasks

Bedrock Marketplace

Discover, evaluate, and deploy 100+ specialised models from third-party providers (re:Invent 2024)

Guardrails (Cross-Account)

Centralised content filtering with cross-account safeguards across an AWS Org (Apr 2026)

TwelveLabs Video Understanding

Search, classify, summarise, and extract insights from video corpora (NYC Summit 2025)

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPRSOC 2ISO 27001HIPAAPCI DSSFedRAMP

Where this runs

40 regions
28 countries
6sovereign
Sovereign regions (6)
  • AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
  • AWS GovCloud (US-East) · AshburnAWS GovCloud (US)
  • AWS GovCloud (US-West) · HillsboroAWS GovCloud (US)
  • AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
  • China (Beijing) · BeijingAWS China (Sinnet)
  • China (Ningxia) · YinchuanAWS China (NWCD)
Commercial regions (34)

Europe (8)

  • Europe (Paris)
  • Europe (Frankfurt)
  • Europe (Ireland)
  • Europe (Milan)
  • Europe (Spain)
  • Europe (Stockholm)
  • Europe (Zurich)
  • Europe (London)

North America (7)

  • Canada West (Calgary)
  • Canada (Central)
  • Mexico (Central)
  • US East (N. Virginia)
  • US West (Oregon)
  • US East (Ohio)
  • US West (N. California)

South America (1)

  • South America (São Paulo)

Asia (11)

  • Asia Pacific (Hong Kong)
  • Asia Pacific (Hyderabad)
  • Asia Pacific (Mumbai)
  • Asia Pacific (Jakarta)
  • Asia Pacific (Osaka)
  • Asia Pacific (Tokyo)
  • Asia Pacific (Malaysia)
  • Asia Pacific (Singapore)
  • Asia Pacific (Seoul)
  • Asia Pacific (Taipei)
  • Asia Pacific (Thailand)

Oceania (3)

  • Asia Pacific (Melbourne)
  • Asia Pacific (Sydney)
  • Asia Pacific (New Zealand)

Middle East (3)

  • Middle East (Bahrain)
  • Israel (Tel Aviv)
  • Middle East (UAE)

Africa (1)

  • Africa (Cape Town)

Tags

Equivalent services on other platforms

Alibaba Qwen (Tongyi Qianwen)Alibaba

Alibaba's flagship open-source foundation model family covering Qwen (text), Qwen-VL (vision-language), Qwen-Audio, and Qwen-Coder — accessible via the DashScope API with chat, completion, embeddings, and function-calling endpoints

Amazon NovaAWS

AWS-built foundation model family covering text (Micro, Lite, Pro, Premier), image generation (Canvas), and video generation (Reel) — accessed through the Bedrock runtime with tight pricing and low-latency streaming, launched at re:Invent 2024

Azure OpenAI ServiceAzure

Enterprise access to OpenAI models including GPT-4, GPT-3.5, and DALL-E with Azure security, private networking, regional deployments, and pay-as-you-go or provisioned throughput

Azure Health BotAzure

HIPAA-compliant conversational AI platform for healthcare with a built-in clinical knowledge graph, triage scenarios, symptom checker, and compliance tooling for building patient-facing chat experiences grounded in medical ontologies

Cloudflare VectorizeCloudflare

Globally-distributed vector database for RAG, similarity search, and recommendations with native Workers AI integration, up to 5M vectors per index, metadata filtering, and cosine / Euclidean / dot-product similarity

Cloudflare Workers AICloudflare

Serverless GPU-backed AI inference at the edge running a catalogue of open-source text, image, speech, and embedding models (Llama, Mistral, Stable Diffusion, Whisper, BGE) with pay-per-neurone pricing and direct binding from Workers code

Mosaic AIDatabricks

End-to-end AI platform (formerly MLflow + Mosaic ML) for training, fine-tuning, deploying, and monitoring foundation models and custom ML models on the Lakehouse

Vector SearchDatabricks

Serverless vector database built into the Lakehouse for similarity search, RAG applications, and recommendation systems with automatic embedding sync from Delta tables

Vertex AIGCP

Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio

Gemini APIGCP

Direct API access to Google's most capable multimodal AI models with text, image, audio, and video understanding, long context windows, and function calling support

DialogflowGCP

Conversational AI platform with Dialogflow CX for complex multi-turn state-machine agents and Dialogflow ES for simpler intent-based bots, integrated with Google's generative playbooks, Contact Center AI, and omnichannel messaging connectors

Watson AssistantIBM

Conversational AI for building chatbots and virtual agents with visual dialogue builder, intent and entity detection, voice integration via phone, and multi-channel deployment

IBM watsonx.aiIBM

Enterprise AI studio for training, validating, tuning, and deploying foundation models and traditional ML models, with IBM's Granite model family, Hugging Face integration, prompt lab, synthetic data generation, and governance via watsonx.governance

Kanana AIKakao

Kakao's Korean-first foundation-model family (Kanana Flash / Essence / Nano) for chat, code, and embeddings — multilingual but tuned for Korean conversational performance

CLOVA StudioNaver

Naver's HyperCLOVA X foundation-model platform for Korean-language LLM workloads — chat completion, embeddings, function calling, RAG over Korean text with strong native-language performance

OCI Generative AIOracle

Fully managed service offering Cohere and Llama large language models

Generative APIsScaleway

Managed inference for open-source LLMs (Llama, Mistral, DeepSeek) hosted in EU datacentres

Snowflake CortexSnowflake

Fully managed AI and ML service offering hosted LLMs, vector search, and ML functions inside Snowflake SQL

Tencent HunyuanTencent

Tencent's in-house family of large language models (Hunyuan-Pro, Standard, Lite, plus multimodal Hunyuan-Vision) accessible via the Hunyuan API, with enterprise-grade context windows up to 256K, function calling, embeddings, and tuning

Pricing

Pricing model:pay-per-token