Python Package to build ETL flows/dags

This page summarizes the projects mentioned and recommended in the original post on /r/Python

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • flowrunner

    Flowrunner is a lightweight package to organize and represent Data Engineering/Science workflows

  • Here is an example of a PySpark flow: https://github.com/prithvijitguha/flowrunner/blob/main/examples/pyspark_example.py

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Hamilton's UI – observability, lineage, and catalog for data pipelines

    1 project | news.ycombinator.com | 2 May 2024
  • Prefect: A workflow orchestration tool for data pipelines

    1 project | news.ycombinator.com | 13 Mar 2024
  • Using IPython Jupyter Magic commands to improve the notebook experience

    1 project | dev.to | 3 Mar 2024
  • Daft: Distributed DataFrame for Python

    2 projects | news.ycombinator.com | 29 Feb 2024
  • Show HN: Hacker News AI built using function calling

    1 project | news.ycombinator.com | 28 Jan 2024