Dataproc
GCPAnalyticsManaged Spark and Hadoop clusters for big data processing with per-second billing, serverless Spark (Dataproc Serverless), Presto, Flink, and ephemeral or long-running clusters
Attributes
- Auto Scaling
- Yes
- Spark Support
- Yes
- Hadoop Support
- Yes
Sub-services (3)
Dataproc Clusters
Managed Spark and Hadoop cluster provisioning
Dataproc Serverless
Serverless Spark for batch workloads
Dataproc Metastore
Managed Hive Metastore for metadata management
Compliance & Certifications
This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.
Where this runs
Sovereign regions (2)
- T-Systems Sovereign Cloud · FrankfurtT-Systems Sovereign Cloud powered by Google Cloud
- S3NS Sovereign Cloud · ParisS3NS — Google Cloud + Thales joint venture
Commercial regions (42)
Europe (13)
- Belgium
- Finland
- Paris
- Berlin
- Frankfurt
- Milan
- Turin
- Netherlands
- Warsaw
- Madrid
- Stockholm
- Zurich
- London
North America (12)
- Montréal
- Toronto
- Querétaro
- Northern Virginia
- Columbus
- Iowa
- Dallas
- Las Vegas
- Los Angeles
- South Carolina
- Salt Lake City
- Oregon
South America (2)
- São Paulo
- Santiago
Asia (9)
- Hong Kong
- Delhi
- Mumbai
- Jakarta
- Osaka
- Tokyo
- Singapore
- Seoul
- Taiwan
Oceania (2)
- Melbourne
- Sydney
Middle East (3)
- Tel Aviv
- Doha
- Dammam
Africa (1)
- Johannesburg
Tags
Equivalent services on other platforms
Managed big-data platform for running Apache Spark, Hive, Presto, Flink, Trino, and HBase across EC2, EKS, and fully serverless deployments with up to 5x faster Spark runtime and Graviton price-performance
Managed open-source analytics clusters for Hadoop, Apache Spark, Apache Hive LLAP, Apache Kafka, and Apache HBase with enterprise security via Enterprise Security Package, autoscale, and integration with ADLS Gen2 — used mainly for migrating existing OSS big-data estates into Azure
Fully managed big-data platform running Apache Spark, Hive, HBase, Flink, Hadoop, and Kudu clusters with autoscaling, Kerberos security, and integration with OBS for compute-storage separation
Managed big-data platform running Apache Spark, Hadoop, Hive, HBase, Flink, Presto, and ClickHouse with autoscaling, Kerberos authentication, integration with COS for compute-storage separation, and Jupyter notebooks for interactive analysis