kyuubi
cuelake
kyuubi | cuelake | |
---|---|---|
1 | 2 | |
1,936 | 284 | |
1.5% | 0.0% | |
9.8 | 0.0 | |
4 days ago | almost 2 years ago | |
Scala | JavaScript | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
kyuubi
cuelake
-
We have open-sourced Cuelake
Check out Cuelake at https://github.com/cuebook/cuelake. Feedback is appreciated.
-
Feedback for open source data engineering tool Cuelake (similar to data bricks)
We have been working on this open source project since 2 months, needed some feedback from data engineers so posting it here. In current state Cuelake can be installed on Kubernetes cluster and can run spark and python code via zeppelin notebooks and these notebooks can be scheduled too. You can check out the repo here: https://github.com/cuebook/cuelake
What are some alternatives?
Trino - Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
kedro-viz - Visualise your Kedro data and machine-learning pipelines and track your experiments.
presto - Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) [Moved to: https://github.com/trinodb/trino]
go-streams - A lightweight stream processing library for Go
incubator-livy - Mirror of Apache livy (Incubating)
react-csv - React components to build CSV files on the fly basing on Array/literal object of data
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
opaque-sql - An encrypted data analytics platform
cakephp-dto - CakePHP DTO plugin - quickly generate useful data transfer objects for your app (mutable/immutable)
zio-protoquill - Quill for Scala 3
Mage - 🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai