cuelake
Local-Data-LakeHouse
Our great sponsors
cuelake | Local-Data-LakeHouse | |
---|---|---|
2 | 1 | |
284 | 43 | |
0.0% | - | |
0.0 | 4.4 | |
almost 2 years ago | 8 months ago | |
JavaScript | Dockerfile | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cuelake
-
We have open-sourced Cuelake
Check out Cuelake at https://github.com/cuebook/cuelake. Feedback is appreciated.
-
Feedback for open source data engineering tool Cuelake (similar to data bricks)
We have been working on this open source project since 2 months, needed some feedback from data engineers so posting it here. In current state Cuelake can be installed on Kubernetes cluster and can run spark and python code via zeppelin notebooks and these notebooks can be scheduled too. You can check out the repo here: https://github.com/cuebook/cuelake
Local-Data-LakeHouse
-
Project showcase: sample Data Lakehouse
Here is the Github repo: https://github.com/dominikhei/Local-Data-LakeHouse
What are some alternatives?
kyuubi - Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
matano - Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
kedro-viz - Visualise your Kedro data and machine-learning pipelines and track your experiments.
incubator-xtable - Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
go-streams - A lightweight stream processing library for Go
minio-dokku - Dockerfile to run Minio (S3 compatible storage) on Dokku (mini-Heroku)
react-csv - React components to build CSV files on the fly basing on Array/literal object of data
qbeast-spark - Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
hive-metastore - Apache Hive Metastore as a Standalone server in Docker
cakephp-dto - CakePHP DTO plugin - quickly generate useful data transfer objects for your app (mutable/immutable)
Rudderstack - Privacy and Security focused Segment-alternative, in Golang and React