Our great sponsors
-
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Apart from airflow, or luigi, if you want a YAML-driven approach with some http api triggers, take a look at tekton (http://tekton.dev) . It can define tasks (as containers) and pipelines of multiple tasks. Has a GUI monitor tool if desired as well.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Is it impossible to contribute to open source as a data engineer?
- Building in Public: Leveraging Tublian's AI Copilot for My Open Source Contributions
- Navigating Week Two: Insights and Experiences from My Tublian Internship Journey
-
Airflow VS quix-streams - a user suggested alternative
2 projects | 7 Dec 2023
- Best ETL Tools And Why To Choose