Gcore Inference at the Edge

GcoreAI & ML

AI inference runtime that deploys models to Gcore's edge POPs and routes requests to the nearest GPU-backed endpoint, with support for open-source LLMs and custom model containers

Jurisdictional exposure

Provider HQ
EULuxembourg, Luxembourg

Subject to GDPR, Data Act, DGA

Region locations
APACEUUKUSOther12 regions across 5 jurisdictions
Sovereign option
No sovereign-flagged regions in the catalogue for this service.

Attributes

GPU Support
Yes
GPU Vendor
nvidia

Sub-services (3)

Open-source LLM endpoints

Pre-hosted endpoints for popular open-weight models with token-based billing

Custom model deployment

Bring-your-own container or weights, deployed to GPU-backed edge nodes

Latency-routed dispatch

Requests automatically routed to the nearest GPU-backed POP with capacity

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPRISO 27001

Where this runs

12 regions
11 countries
Commercial regions (12)

Europe (6)

  • Paris
  • Frankfurt
  • Luxembourg
  • Amsterdam
  • Madrid
  • London

North America (2)

  • US East (Virginia)
  • US West (Santa Clara)

South America (1)

  • São Paulo

Asia (2)

  • Tokyo
  • Singapore

Oceania (1)

  • Sydney

Tags

Equivalent services on other platforms

Amazon SageMakerAWS

Next-generation SageMaker (rebranded SageMaker AI) unifying data, analytics, and AI in one workspace — Studio notebooks, HyperPod for foundation-model training at scale, Lakehouse with QuickSight + S3 Tables integration, AutoPilot AutoML, managed training jobs, hosted inference endpoints, and Feature Store, with re:Invent 2024 introducing the unified SageMaker AI workspace and 2025 Summit additions extending it with lakehouse auto-onboarding

Azure AI ServicesAzure

Pre-built AI APIs for vision, speech, language, and decision

Cloudflare Workers AICloudflare

Serverless GPU-backed AI inference at the edge running a catalogue of open-source text, image, speech, and embedding models (Llama, Mistral, Stable Diffusion, Whisper, BGE) with pay-per-neurone pricing and direct binding from Workers code

Mosaic AIDatabricks

End-to-end AI platform (formerly MLflow + Mosaic ML) for training, fine-tuning, deploying, and monitoring foundation models and custom ML models on the Lakehouse

Vertex AIGCP

Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio

IBM watsonx.aiIBM

Enterprise AI studio for training, validating, tuning, and deploying foundation models and traditional ML models, with IBM's Granite model family, Hugging Face integration, prompt lab, synthetic data generation, and governance via watsonx.governance

OCI Generative AIOracle

Fully managed service offering Cohere and Llama large language models

Pricing

Pricing model:per-inference