Top 7 data-engineering-pipeline Open-Source Projects
-
Udacity-Data-Engineering-Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
-
business_closures_de_pipeline
Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database
-
Shift
Shift is a high performance better alternative to Airbyte, Singer, Meltano (by piyushsingariya)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Here's the project: https://github.com/vmware/versatile-data-kit
Project mention: Created my first Data Engineering Project which integrates F1 data using Prefect, Terraform, dbt, BigQuery and Looker Studio | /r/dataengineering | 2023-07-01Github
As side hobby I started working on this personal project https://github.com/piyushsingariya/Kaku
data-engineering-pipeline related posts
Index
What are some of the best open-source data-engineering-pipeline projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Udacity-Data-Engineering-Projects | 1,295 |
2 | versatile-data-kit | 411 |
3 | prefect-deployment-patterns | 93 |
4 | Apache-Spark-Guide | 28 |
5 | f1-data-pipeline | 23 |
6 | business_closures_de_pipeline | 14 |
7 | Shift | 9 |
Sponsored