Udacity-Data-Engineering-Projects
StravaDataPipline
Udacity-Data-Engineering-Projects | StravaDataPipline | |
---|---|---|
5 | 1 | |
1,295 | 28 | |
- | - | |
0.0 | 6.0 | |
over 1 year ago | almost 2 years ago | |
Python | Python | |
GNU General Public License v3.0 or later | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Udacity-Data-Engineering-Projects
- Pitanje za data engineering?
-
✨ 5 Free Resources to Learn Data Engineering 🚀
🔗 https://github.com/san089/Udacity-Data-Engineering-Projects
-
How can I become a big data engineer?
You can start with googling data engineering learning path to get a sense of what you need to know. If you are looking for simple projects to start with then you can look at this as well (https://github.com/san089/Udacity-Data-Engineering-Projects).
-
Beginner DE projects.
For practice, Data Modeling with Postgres and Udacity Data Engineering Projects as examples, and Data Engineering Project for Beginners - Batch edition for a guided tutorial.
- Data Pipeline Examples in Action
StravaDataPipline
-
ELT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
The GitHub repo can be found here: https://github.com/jackmleitch/StravaDataPipline A corresponding blog post can also be found here: https://jackmleitch.com/blog/Strava-Data-Pipeline
What are some alternatives?
hydra - Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
airflow-docker - This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
data-engineering-zoomcamp - Free Data Engineering course!
audiophile-e2e-pipeline - Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.
data-engineering-book - Accumulated knowledge and experience in the field of Data Engineering
versatile-data-kit - One framework to develop, deploy and operate data workflows with Python and SQL.
ask-astro - An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer
spotify-api - Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped
pg-counter-metrics - PG Counter Metrics ( PGCM ) is a tool for publishing PostgreSQL performance data to CloudWatch. By publishing to CloudWatch, dashboards and alarming can be used on the collected data.
Skytrax-Data-Warehouse - A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
canarypy - CanaryPy - A light and powerful canary release for Data Pipelines
Data-Engineering-Projects - Personal Data Engineering Projects