ELT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

StravaDataPipline

1 28 6.0 Python

:arrows_counterclockwise: :running: EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow

The GitHub repo can be found here: https://github.com/jackmleitch/StravaDataPipline A corresponding blog post can also be found here: https://jackmleitch.com/blog/Strava-Data-Pipeline

versatile-data-kit

52 410 9.7 Python

One framework to develop, deploy and operate data workflows with Python and SQL.

I believe that you would not need to build the docker image yourself. There are data engineering frameworks which allow you to build your data jobs yourself and take care of the containerisation of your pipeline. You can have a look at this ingest from rest API example. They would also allow you to schedule your data job using cron, while data job itself can contain SQL & Python.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: Use function calling to build AI Assistants
1 project | news.ycombinator.com | 27 Feb 2024
Phidata: Build AI Assistants using function calling
1 project | news.ycombinator.com | 25 Feb 2024
Chat with ArXiv Papers
2 projects | news.ycombinator.com | 5 Feb 2024
Chat with PDFs using function calling
2 projects | news.ycombinator.com | 2 Feb 2024
Celebrating my first Data Engineering Project -- Fitbit data with PySpark, GCP, prefect, and terraform!
4 projects | /r/dataengineering | 12 Oct 2022

ELT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering
Python AWS aws-s3 aws-redshift data-engineering
Post date: 24 Jun 2022

StravaDataPipline

versatile-data-kit

WorkOS

Related posts

ELT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering Python AWS aws-s3 aws-redshift data-engineering Post date: 24 Jun 2022

StravaDataPipline

versatile-data-kit

WorkOS

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering
Python AWS aws-s3 aws-redshift data-engineering
Post date: 24 Jun 2022