Cloudflare AI Gateway
CloudflareAI & MLFree tier availableUnified proxy layer for LLM API traffic with caching of identical prompts, per-provider rate limits, retry and fallback policies, cost and latency analytics, and prompt logging for audit — works across OpenAI, Anthropic, Google, and Workers AI
Attributes
- Multi Provider
- Yes
Sub-services (4)
Response Caching
Deduplicate identical prompts across sessions to cut inference spend
Rate Limits
Per-gateway, per-user, or per-provider throttles to prevent runaway costs
Fallbacks and Retries
Automatic failover to a backup model or provider on error or timeout
Logging and Analytics
Full-prompt audit logs plus cost, latency, and error-rate dashboards
Compliance & Certifications
This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.
GDPRSOC 2ISO 27001HIPAAPCI DSS
Where this runs
29 regions
26 countries
Commercial regions (29)
Europe (10)
- Paris
- Frankfurt
- Dublin
- Milan
- Amsterdam
- Warsaw
- Madrid
- Stockholm
- Zurich
- London
North America (4)
- Toronto
- Ashburn
- Chicago
- San Jose
South America (2)
- Buenos Aires
- São Paulo
Asia (6)
- Hong Kong
- Mumbai
- Tokyo
- Singapore
- Seoul
- Taipei
Oceania (2)
- Sydney
- Auckland
Middle East (2)
- Tel Aviv
- Dubai
Africa (3)
- Lagos
- Cape Town
- Johannesburg
Tags
Pricing
Pricing model:freemium