Top 3 google-cloud-dataproc Open-Source Projects
-
spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
-
initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Project mention: Dependency issue with Pyspark running on Kubernetes using spark-on-k8s-operator | /r/codehunter | 2023-05-31I have spent days now trying to figure out a dependency issue I'm experiencing with (Py)Spark running on Kubernetes. I'm using the spark-on-k8s-operator and Spark's Google Cloud connector.
Index
What are some of the best open-source google-cloud-dataproc projects? This list will help you:
Project | Stars | |
---|---|---|
1 | spark-operator | 2,613 |
2 | initialization-actions | 582 |
3 | spark-bigquery-connector | 351 |
Sponsored