Gcore Inference at the Edge
GcoreAI & MLAI inference runtime that deploys models to Gcore's edge POPs and routes requests to the nearest GPU-backed endpoint, with support for open-source LLMs and custom model containers
Jurisdictional exposure
Attributes
- GPU Support
- Yes
- GPU Vendor
- nvidia
Sub-services (3)
Open-source LLM endpoints
Pre-hosted endpoints for popular open-weight models with token-based billing
Custom model deployment
Bring-your-own container or weights, deployed to GPU-backed edge nodes
Latency-routed dispatch
Requests automatically routed to the nearest GPU-backed POP with capacity
Compliance & Certifications
This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.
Where this runs
Commercial regions (12)
Europe (6)
- Paris
- Frankfurt
- Luxembourg
- Amsterdam
- Madrid
- London
North America (2)
- US East (Virginia)
- US West (Santa Clara)
South America (1)
- São Paulo
Asia (2)
- Tokyo
- Singapore
Oceania (1)
- Sydney
Tags
Equivalent services on other platforms
Next-generation SageMaker (rebranded SageMaker AI) unifying data, analytics, and AI in one workspace — Studio notebooks, HyperPod for foundation-model training at scale, Lakehouse with QuickSight + S3 Tables integration, AutoPilot AutoML, managed training jobs, hosted inference endpoints, and Feature Store, with re:Invent 2024 introducing the unified SageMaker AI workspace and 2025 Summit additions extending it with lakehouse auto-onboarding
Pre-built AI APIs for vision, speech, language, and decision
Serverless GPU-backed AI inference at the edge running a catalogue of open-source text, image, speech, and embedding models (Llama, Mistral, Stable Diffusion, Whisper, BGE) with pay-per-neurone pricing and direct binding from Workers code
End-to-end AI platform (formerly MLflow + Mosaic ML) for training, fine-tuning, deploying, and monitoring foundation models and custom ML models on the Lakehouse
Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio
Enterprise AI studio for training, validating, tuning, and deploying foundation models and traditional ML models, with IBM's Granite model family, Hugging Face integration, prompt lab, synthetic data generation, and governance via watsonx.governance
Fully managed service offering Cohere and Llama large language models