OVHcloud AI Endpoints

Name: OVHcloud AI Endpoints
Brand: OVHcloud

Managed serverless inference endpoints for open-source large language models (Llama, Mistral, DeepSeek), with OpenAI-compatible API and per-million-tokens pricing — positioned as the EU-resident alternative to hosted OpenAI / Anthropic APIs

FluffyStack tools

Add to Service Builder Add to Compare Compare with equivalents Explore OVH in Treemap Explore ai-ml in Honeycomb See OVH regions on the World Map See ai-ml as a network Score jurisdiction exposure

Documentation Pricing OVH website

Jurisdictional exposure

Provider HQ

EURoubaix, France

Subject to GDPR, Data Act, DGA

Region locations

APACEUUKUSOther14 regions across 5 jurisdictions

Sovereign option

Yes — 8 sovereign-flagged regions available

Full scorecard for this service →EU lens detail →Sovereign cloud coverage map →

Sub-services (2)

LLM Endpoints

Hosted inference endpoints for Llama / Mistral / DeepSeek variants

Embedding Models

Vector-embedding endpoints for RAG and similarity-search workloads

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPR ISO 27001 G-Cloud

Where this runs

14 regions

9 countries

8sovereign

Sovereign regions (8)

GravelinesOVHcloud Sovereign Cloud (EU)
RoubaixOVHcloud Sovereign Cloud (EU)
StrasbourgOVHcloud Sovereign Cloud (EU)
ParisOVHcloud Sovereign Cloud (EU)
FrankfurtOVHcloud Sovereign Cloud (EU)
LondonOVHcloud Sovereign Cloud (UK)
WarsawOVHcloud Sovereign Cloud (EU)
Erith · LondonOVHcloud Sovereign Cloud (UK)

Commercial regions (6)

North America (3)

Beauharnois
Hillsboro
Vint Hill

Asia (2)

Mumbai
Singapore

Oceania (1)

Sydney

Equivalent services on other platforms

Aruba AI StackAruba

Dedicated AI infrastructure stack combining GPU-on-Demand compute, Object Storage, and managed model hosting for end-to-end AI workloads on Italian-sovereign infrastructure

Amazon BedrockAWS

Build generative AI applications with foundation models from Anthropic (Claude Opus 4.7 from April 2026), Cohere, Meta, Mistral, Stability AI, TwelveLabs (video understanding), and Amazon's own Nova family — accessed via a single API with fine-tuning, knowledge bases, agents, and a model marketplace for discovery and easy onboarding

Azure OpenAI ServiceAzure

Enterprise access to OpenAI models including GPT-4, GPT-3.5, and DALL-E with Azure security, private networking, regional deployments, and pay-as-you-go or provisioned throughput

Exoscale AI Cloud InfrastructureExoscale

Dedicated AI/ML infrastructure combining GPU compute, Object Storage for datasets, and managed model hosting on Swiss-resident infrastructure — positioned for regulated EU customers needing AI workloads outside US jurisdiction

Vertex AIGCP

Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio

Infomaniak AI ToolsInfomaniak

Sovereign Swiss AI suite: managed inference endpoints for open-source LLMs (Llama, Mistral), AI Studio chat interface (Kchat), and document-analysis APIs — positioned as the Swiss-resident alternative to hosted OpenAI / Anthropic APIs

Nscale Serverless InferenceNscale

Managed serverless inference endpoints for open-source large language models hosted on Nscale's GPU infrastructure, with OpenAI-compatible API and per-million-tokens pricing

OCI Generative AIOracle

Managed inference service hosting Cohere Command and Embed plus Meta Llama large language models — pay-per-token chat / completion / embedding APIs, plus fine-tuning on customer datasets via dedicated AI clusters

Outscale AI StudioOutscale

Managed AI platform hosting Mistral AI models (including Le Chat Enterprise) on Outscale's SecNumCloud-eligible French infrastructure — positioned as the strictest-sovereignty alternative to hosted OpenAI / Anthropic / Bedrock APIs for European public-sector and regulated workloads

Scaleway Generative APIsScaleway

Managed serverless inference endpoints for hosted open-source LLMs (Llama, Mistral, DeepSeek) with OpenAI-compatible API and per-million-tokens pricing — distinct from Scaleway's full AI Platform (training + custom-model hosting)

STACKIT AI Model ServingSTACKIT

Managed serverless inference endpoints for hosted open-source LLMs (Llama, Mistral) with OpenAI-compatible API and per-million-tokens pricing — positioned as the German-sovereign alternative to hosted OpenAI / Anthropic APIs

Pricing

Pricing model:per-million-tokens