Cloudflare AI Gateway

CloudflareAI & MLFree tier available

Unified proxy layer for LLM API traffic with caching of identical prompts, per-provider rate limits, retry and fallback policies, cost and latency analytics, and prompt logging for audit — works across OpenAI, Anthropic, Google, and Workers AI

Attributes

Multi Provider
Yes

Sub-services (4)

Response Caching

Deduplicate identical prompts across sessions to cut inference spend

Rate Limits

Per-gateway, per-user, or per-provider throttles to prevent runaway costs

Fallbacks and Retries

Automatic failover to a backup model or provider on error or timeout

Logging and Analytics

Full-prompt audit logs plus cost, latency, and error-rate dashboards

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPRSOC 2ISO 27001HIPAAPCI DSS

Where this runs

29 regions
26 countries
Commercial regions (29)

Europe (10)

  • Paris
  • Frankfurt
  • Dublin
  • Milan
  • Amsterdam
  • Warsaw
  • Madrid
  • Stockholm
  • Zurich
  • London

North America (4)

  • Toronto
  • Ashburn
  • Chicago
  • San Jose

South America (2)

  • Buenos Aires
  • São Paulo

Asia (6)

  • Hong Kong
  • Mumbai
  • Tokyo
  • Singapore
  • Seoul
  • Taipei

Oceania (2)

  • Sydney
  • Auckland

Middle East (2)

  • Tel Aviv
  • Dubai

Africa (3)

  • Lagos
  • Cape Town
  • Johannesburg

Tags

Pricing

Pricing model:freemium