Managed retrieval-augmented-generation service — index your content (R2 buckets, websites, Workers KV) and query it with natural language from a Workers binding, REST API, or MCP server. Originally launched as AutoRAG and renamed AI Search in 2026.
Jurisdictional exposure
Attributes
- Ga Year
- 2026
Sub-services (3)
Indexing
Ingest content from R2 buckets, websites, or Workers KV into the vector index
Natural-Language Query
Retrieve relevant chunks from a Worker binding, REST API, or MCP server
Evaluation
Built-in eval metrics for retrieval and generation quality
Compliance & Certifications
This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.
Where this runs
Commercial regions (29)
Europe (10)
- Paris
- Frankfurt
- Dublin
- Milan
- Amsterdam
- Warsaw
- Madrid
- Stockholm
- Zurich
- London
North America (4)
- Toronto
- Ashburn
- Chicago
- San Jose
South America (2)
- Buenos Aires
- São Paulo
Asia (6)
- Hong Kong
- Mumbai
- Tokyo
- Singapore
- Seoul
- Taipei
Oceania (2)
- Sydney
- Auckland
Middle East (2)
- Tel Aviv
- Dubai
Africa (3)
- Lagos
- Cape Town
- Johannesburg
Tags
Equivalent services on other platforms
Build generative AI applications with foundation models from Anthropic (Claude Opus 4.7 from April 2026), Cohere, Meta, Mistral, Stability AI, TwelveLabs (video understanding), and Amazon's own Nova family — accessed via a single API with fine-tuning, knowledge bases, agents, and a model marketplace for discovery and easy onboarding
Enterprise search-as-a-service (formerly Azure Cognitive Search) with vector, hybrid, and semantic ranking, built-in AI skills for OCR and NLP enrichment, first-class integration with Azure OpenAI for RAG workloads, and 90+ data-source connectors including SharePoint, OneDrive, and Salesforce
Serverless vector database built into the Lakehouse for similarity search, RAG applications, and recommendation systems with automatic embedding sync from Delta tables
Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio
End-to-end platform for building, deploying, and governing production AI workloads on OCI — unifies Gen AI models, agent orchestration, retrieval-augmented generation, and policy-based governance controls in a single managed service so enterprises don't have to assemble them from primitives.
Fully managed AI and ML service offering hosted LLMs, vector search, and ML functions inside Snowflake SQL