ETL with python

This page summarizes the projects mentioned and recommended in the original post on /r/ETL

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • airflow-docker

    Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j) (by coder2j)

  • You can watch my Apache Airflow for Beginner Tutorial Series playlist on YouTube. If you think it is helpful, consider subscribing to my youtube channel and star my GitHub repository. Comment what topics you want to see or discuss about Airflow in the next episode.

  • ploomber

    The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

  • I recommend using Ploomber which can help you build once and automate a lot of the work, and it works with python natively. It's open source so you can start with one of the examples, like the ML-basic example or the ETL one. It'll allow you to define the pipeline and then easily explain the flow with the DAG plot. Feel free to ask questions, I'm happy to help (I've built 100s of data pipelines over the years).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • projects

    Sample projects using Ploomber. (by ploomber)

  • I recommend using Ploomber which can help you build once and automate a lot of the work, and it works with python natively. It's open source so you can start with one of the examples, like the ML-basic example or the ETL one. It'll allow you to define the pipeline and then easily explain the flow with the DAG plot. Feel free to ask questions, I'm happy to help (I've built 100s of data pipelines over the years).

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • The Unbundling of Airflow

    3 projects | news.ycombinator.com | 15 Feb 2022
  • Show HN: JupySQL – a SQL client for Jupyter (ipython-SQL successor)

    2 projects | news.ycombinator.com | 6 Dec 2023
  • New to large SW projects in Python, best practices to organize code

    1 project | /r/Python | 11 Nov 2022
  • A three-part series on deploying a Data Science Platform on AWS

    1 project | /r/dataengineering | 4 Nov 2022
  • Ploomber Cloud - Parametrizing and running notebooks in the cloud in parallel

    3 projects | /r/IPython | 3 Nov 2022