airflow-docker
StravaDataPipline
airflow-docker | StravaDataPipline | |
---|---|---|
1 | 1 | |
21 | 28 | |
- | - | |
7.0 | 6.0 | |
3 months ago | almost 2 years ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
airflow-docker
-
Airflow Api tests
Clone the airflow-docker repo.
StravaDataPipline
-
ELT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
The GitHub repo can be found here: https://github.com/jackmleitch/StravaDataPipline A corresponding blog post can also be found here: https://jackmleitch.com/blog/Strava-Data-Pipeline
What are some alternatives?
soda-sql - Data profiling, testing, and monitoring for SQL accessible data.
Udacity-Data-Engineering-Projects - Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
wsl-windows-toolbar-launcher - Adds linux GUI application menu to a windows toolbar
audiophile-e2e-pipeline - Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.
superset - Apache Superset is a Data Visualization and Data Exploration Platform
versatile-data-kit - One framework to develop, deploy and operate data workflows with Python and SQL.
nft-starter-kit - Timescale NFT Starter Kit
spotify-api - Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped
airflow-api-tests - This is a collection of Pytest for the 2.0 Stable Rest Apis for Apache Airflow. I have another repo where you could setup airflow locally and play around with these. I am used to RestAssured, but trying out pytest here.
Skytrax-Data-Warehouse - A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
portable-data-stack-dagster - A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
cargo-crates - An easy way to build data extractors in Docker.