Spark-The-Definitive-Guide
LearningSparkV2
Spark-The-Definitive-Guide | LearningSparkV2 | |
---|---|---|
1 | 1 | |
2,734 | 1,095 | |
1.4% | 3.3% | |
10.0 | 0.0 | |
over 3 years ago | over 1 year ago | |
Scala | Scala | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Spark-The-Definitive-Guide
-
datadelivery: Providing public datasets to explore in AWS
Spark - The Definitive Guide
LearningSparkV2
-
datadelivery: Providing public datasets to explore in AWS
Learning Spark
What are some alternatives?
datadelivery - A Terraform module that provides an efficient way to activate pieces and services in an AWS account in order to enable users to explore preselected public datasets.
incubator-gluten - Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache-Hive-Essentials-Second-Edition - Apache Hive Essentials, Second Edition published by Packt
kyuubi - Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Read the Docs - The source code that powers readthedocs.org
delta-sharing - An open protocol for secure data sharing
s3-sqs-connector - A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
delta - An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs