StravaDataPipline
airflow-docker
StravaDataPipline | airflow-docker | |
---|---|---|
1 | 1 | |
28 | 22 | |
- | - | |
6.0 | 7.0 | |
almost 2 years ago | 3 months ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
StravaDataPipline
-
ELT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
The GitHub repo can be found here: https://github.com/jackmleitch/StravaDataPipline A corresponding blog post can also be found here: https://jackmleitch.com/blog/Strava-Data-Pipeline
airflow-docker
-
Airflow Api tests
Clone the airflow-docker repo.
What are some alternatives?
Udacity-Data-Engineering-Projects - Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
soda-sql - Data profiling, testing, and monitoring for SQL accessible data.
audiophile-e2e-pipeline - Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.
wsl-windows-toolbar-launcher - Adds linux GUI application menu to a windows toolbar
versatile-data-kit - One framework to develop, deploy and operate data workflows with Python and SQL.
superset - Apache Superset is a Data Visualization and Data Exploration Platform
spotify-api - Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped
nft-starter-kit - Timescale NFT Starter Kit
Skytrax-Data-Warehouse - A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
airflow-api-tests - This is a collection of Pytest for the 2.0 Stable Rest Apis for Apache Airflow. I have another repo where you could setup airflow locally and play around with these. I am used to RestAssured, but trying out pytest here.
portable-data-stack-dagster - A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
cargo-crates - An easy way to build data extractors in Docker.