incubator-gluten
FLaNK-EveryTransitSystem
incubator-gluten | FLaNK-EveryTransitSystem | |
---|---|---|
3 | 8 | |
988 | 3 | |
3.0% | - | |
9.9 | 6.1 | |
7 days ago | 5 months ago | |
Scala | ||
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
incubator-gluten
-
A glimpse into the future of data processing infrastructure.
When I first learned about the Gluten project from Intel, I thought Databricks was going to be in trouble.
- FLaNK Stack for 04 December 2023
-
Blaze: Fast query execution engine for Apache Spark
Interesting, looks like it is just DataFusion engine for Spark. There is a similar project: https://github.com/oap-project/gluten - it brings ClickHouse as an engine to Spark.
FLaNK-EveryTransitSystem
What are some alternatives?
LearningSparkV2 - This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
pgmq - A lightweight message queue. Like AWS SQS and RSMQ but on Postgres.
opaque-sql - An encrypted data analytics platform
StyleTTS2 - StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
blaze - Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
imgbeddings - Python package to generate image embeddings with CLIP without PyTorch/TensorFlow
blaze - NumPy and Pandas interface to Big Data
ComfyUI - The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Jupyter Scala - A Scala kernel for Jupyter
ML-For-Beginners - 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
kyuubi - Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Scada-LTS - Scada-LTS is an Open Source, web-based, multi-platform solution for building your own SCADA (Supervisory Control and Data Acquisition) system.