incubator-gluten
LearningSparkV2
incubator-gluten | LearningSparkV2 | |
---|---|---|
3 | 1 | |
988 | 1,095 | |
3.0% | 3.3% | |
9.9 | 0.0 | |
7 days ago | over 1 year ago | |
Scala | Scala | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
incubator-gluten
-
A glimpse into the future of data processing infrastructure.
When I first learned about the Gluten project from Intel, I thought Databricks was going to be in trouble.
- FLaNK Stack for 04 December 2023
-
Blaze: Fast query execution engine for Apache Spark
Interesting, looks like it is just DataFusion engine for Spark. There is a similar project: https://github.com/oap-project/gluten - it brings ClickHouse as an engine to Spark.
LearningSparkV2
-
datadelivery: Providing public datasets to explore in AWS
Learning Spark
What are some alternatives?
opaque-sql - An encrypted data analytics platform
kyuubi - Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
blaze - Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Spark-The-Definitive-Guide - Spark: The Definitive Guide's Code Repository
blaze - NumPy and Pandas interface to Big Data
delta-sharing - An open protocol for secure data sharing
Jupyter Scala - A Scala kernel for Jupyter
datadelivery - A Terraform module that provides an efficient way to activate pieces and services in an AWS account in order to enable users to explore preselected public datasets.
s3-sqs-connector - A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
narrator - David Attenborough narrates your life
Apache-Hive-Essentials-Second-Edition - Apache Hive Essentials, Second Edition published by Packt