Vertex AI

GCPAI & MLFree tier available

Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio

Attributes

GPU Support
Yes
Auto ML
Yes
Model Registry
Yes

Sub-services (4)

Custom Training

Distributed training for custom ML models

Online Prediction

Low-latency model serving endpoints

Vertex AI Pipelines

Serverless ML workflow orchestration

Feature Store

Centralized repository for ML features

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPRSOC 2ISO 27001HIPAAPCI DSSFedRAMP

Where this runs

44 regions
28 countries
2sovereign
Sovereign regions (2)
  • T-Systems Sovereign Cloud · FrankfurtT-Systems Sovereign Cloud powered by Google Cloud
  • S3NS Sovereign Cloud · ParisS3NS — Google Cloud + Thales joint venture
Commercial regions (42)

Europe (13)

  • Belgium
  • Finland
  • Paris
  • Berlin
  • Frankfurt
  • Milan
  • Turin
  • Netherlands
  • Warsaw
  • Madrid
  • Stockholm
  • Zurich
  • London

North America (12)

  • Montréal
  • Toronto
  • Querétaro
  • Northern Virginia
  • Columbus
  • Iowa
  • Dallas
  • Las Vegas
  • Los Angeles
  • South Carolina
  • Salt Lake City
  • Oregon

South America (2)

  • São Paulo
  • Santiago

Asia (9)

  • Hong Kong
  • Delhi
  • Mumbai
  • Jakarta
  • Osaka
  • Tokyo
  • Singapore
  • Seoul
  • Taiwan

Oceania (2)

  • Melbourne
  • Sydney

Middle East (3)

  • Tel Aviv
  • Doha
  • Dammam

Africa (1)

  • Johannesburg

Tags

Equivalent services on other platforms

Alibaba Platform for AI (PAI)Alibaba

Enterprise ML and AI platform covering PAI-Studio visual workflow builder, PAI-DSW Jupyter notebooks, PAI-EAS elastic inference serving, PAI-Blade inference optimisation, and integration with Alibaba's Qwen foundation models

Amazon SageMakerAWS

Next-generation SageMaker (rebranded SageMaker AI) unifying data, analytics, and AI in one workspace — Studio notebooks, HyperPod for foundation-model training at scale, Lakehouse with QuickSight + S3 Tables integration, AutoPilot AutoML, managed training jobs, hosted inference endpoints, and Feature Store, with re:Invent 2024 introducing the unified SageMaker AI workspace and 2025 Summit additions extending it with lakehouse auto-onboarding

Amazon BedrockAWS

Build generative AI applications with foundation models from Anthropic (Claude Opus 4.7 from April 2026), Cohere, Meta, Mistral, Stability AI, TwelveLabs (video understanding), and Amazon's own Nova family — accessed via a single API with fine-tuning, knowledge bases, agents, and a model marketplace for discovery and easy onboarding

Amazon Bedrock AgentCoreAWS

Production runtime for AI agents — managed memory, identity, gateway, observability, and tool integration so teams can ship agentic workflows on top of any framework (Strands Agents, LangGraph, CrewAI, vendor-direct) without rebuilding the operational substrate

Amazon S3 VectorsAWS

Native vector storage in S3 — up to 2 billion vectors per index, sub-100 ms query latency, S3-native durability, and pricing claimed up to 90 percent lower than dedicated vector databases for retrieval-augmented generation and embedding-heavy workloads

Azure OpenAI ServiceAzure

Enterprise access to OpenAI models including GPT-4, GPT-3.5, and DALL-E with Azure security, private networking, regional deployments, and pay-as-you-go or provisioned throughput

Azure Machine LearningAzure

End-to-end platform for building and deploying ML models with automated ML, designer (drag-and-drop), managed compute clusters, MLflow tracking, and responsible AI dashboards

Cloudflare StreamCloudflare

Video streaming platform with global delivery and per-minute pricing

Cloudflare VectorizeCloudflare

Globally-distributed vector database for RAG, similarity search, and recommendations with native Workers AI integration, up to 5M vectors per index, metadata filtering, and cosine / Euclidean / dot-product similarity

Cloudflare Workers AICloudflare

Serverless GPU-backed AI inference at the edge running a catalogue of open-source text, image, speech, and embedding models (Llama, Mistral, Stable Diffusion, Whisper, BGE) with pay-per-neurone pricing and direct binding from Workers code

Mosaic AIDatabricks

End-to-end AI platform (formerly MLflow + Mosaic ML) for training, fine-tuning, deploying, and monitoring foundation models and custom ML models on the Lakehouse

Vector SearchDatabricks

Serverless vector database built into the Lakehouse for similarity search, RAG applications, and recommendation systems with automatic embedding sync from Delta tables

Managed MLflowDatabricks

Hosted MLflow with managed tracking server, model registry, and deployment integration — open-source ML lifecycle tooling tightly coupled to Databricks Unity Catalog and Mosaic AI

ModelArtsHuawei

End-to-end AI development platform with AutoML, data labelling, distributed training on Ascend and GPU clusters, and one-click deployment to cloud or edge

IBM watsonx.aiIBM

Enterprise AI studio for training, validating, tuning, and deploying foundation models and traditional ML models, with IBM's Granite model family, Hugging Face integration, prompt lab, synthetic data generation, and governance via watsonx.governance

Kanana AIKakao

Kakao's Korean-first foundation-model family (Kanana Flash / Essence / Nano) for chat, code, and embeddings — multilingual but tuned for Korean conversational performance

CLOVA StudioNaver

Naver's HyperCLOVA X foundation-model platform for Korean-language LLM workloads — chat completion, embeddings, function calling, RAG over Korean text with strong native-language performance

Red Hat OpenShift AIOpenShift

Managed MLOps platform (formerly Open Data Hub) for training, serving, and monitoring ML models on OpenShift with JupyterHub, KServe, Kubeflow, and PyTorch operators

OCI Generative AIOracle

Fully managed service offering Cohere and Llama large language models

OCI Data ScienceOracle

Fully managed machine learning platform with JupyterLab notebooks, conda-environment library, job orchestration, model deployments as HTTPS endpoints, feature store, and model catalog — integrated with Autonomous Database and Object Storage for end-to-end ML workflows

Einstein AISalesforce

AI layer integrated across Salesforce products: predictive lead scoring, opportunity scoring, generative AI for emails and summaries

Salesforce AgentforceSalesforce

Salesforce platform for building, deploying, and governing autonomous AI agents grounded in CRM data, with Atlas Reasoning Engine, Agent Studio, and Data Cloud retrieval

Generative APIsScaleway

Managed inference for open-source LLMs (Llama, Mistral, DeepSeek) hosted in EU datacentres

SnowparkSnowflake

Developer framework for building data applications in Python, Java, and Scala that run inside Snowflake

Pricing

Pricing model:pay-as-you-go