Top 6 Python data-engineering-pipeline Projects
-
Udacity-Data-Engineering-Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
-
Here's the project: https://github.com/vmware/versatile-data-kit
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
-
-
Project mention: Created my first Data Engineering Project which integrates F1 data using Prefect, Terraform, dbt, BigQuery and Looker Studio | /r/dataengineering | 2023-07-01
Github
-
business_closures_de_pipeline
Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
The latest post mention was on 2023-07-01.
Python data-engineering-pipeline related posts
Index
What are some of the best open-source data-engineering-pipeline projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | Udacity-Data-Engineering-Projects | 1,295 |
2 | versatile-data-kit | 406 |
3 | prefect-deployment-patterns | 87 |
4 | Apache-Spark-Guide | 26 |
5 | f1-data-pipeline | 23 |
6 | business_closures_de_pipeline | 13 |
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com