google-cloud-dataproc

Open-source projects categorized as google-cloud-dataproc
Language: + Go + Shell + Java

Top 3 google-cloud-dataproc Open-Source Projects

  • spark-operator

    Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

  • Project mention: Dependency issue with Pyspark running on Kubernetes using spark-on-k8s-operator | /r/codehunter | 2023-05-31

    I have spent days now trying to figure out a dependency issue I'm experiencing with (Py)Spark running on Kubernetes. I'm using the spark-on-k8s-operator and Spark's Google Cloud connector.

  • initialization-actions

    Run in all nodes of your cluster before the cluster starts - lets you customize your cluster

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • spark-bigquery-connector

    BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Index

What are some of the best open-source google-cloud-dataproc projects? This list will help you:

Project Stars
1 spark-operator 2,613
2 initialization-actions 582
3 spark-bigquery-connector 351

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com