Python apache-airflow

Open-source Python projects categorized as apache-airflow

Top 13 Python apache-airflow Projects

apache-airflow
  1. Airflow

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    Project mention: From Zero to Job Data Visualization vs Power BI: Which Wins? | dev.to | 2026-05-07

    For senior engineers building custom job data visualization pipelines, the single biggest latency gain comes from pre-aggregating frequently accessed metrics instead of running joins at query time. In our benchmarks, querying raw job_postings tables with 1M rows took 210ms average, while pre-aggregated tables (updated hourly via PostgreSQL materialized views) reduced query time to 12ms. Use tools like Apache Airflow 2.7.3 to schedule materialized view refreshes during off-peak hours. For example, a materialized view for average salary by company can be defined as:

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. elyra

    Elyra extends JupyterLab with an AI centric approach.

  4. airflow-maintenance-dags

    A series of DAGs/Workflows to help maintain the operation of Airflow

  5. astronomer-cosmos

    Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code

  6. couler

    Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

  7. ethereum-etl-airflow

    Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-contract-into-bigquery-in-8-mins-bab5db1fdeee

  8. agents

    AI agent tooling for data engineering workflows. (by astronomer)

    Project mention: Agent Skills for Data Engineering (Airflow, Dbt, Analytics) | news.ycombinator.com | 2026-02-25
  9. airflow-chart

    A Helm chart to install Apache Airflow on Kubernetes

  10. MCP-Airflow-API

    ⚡ Control Apache Airflow with natural language via MCP. Chat with your workflows using Claude, GPT, or any LLM — no REST API calls needed. Supports Airflow 2.x (43 tools) & 3.0+ (45+ tools).

    Project mention: Show HN: MCP-Server for Control Airflow Cluster | news.ycombinator.com | 2025-08-16
  11. covid-19-data-engineering-pipeline

    A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.

  12. e2e-structured-streaming

    End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API, sends the data to Kafka, and processes it with Spark before writing to Cassandra. The pipeline, built with Python and Apache Zookeeper, is containerized with Docker for easy deployment and scalability.

  13. F2-Data-Pipeline

    Pipeline for Automated Updates of Kaggle's "Formula 2 Dataset"

  14. twitter_data-lakehouse_minio_drill_superset

    Building a Data Lakehouse for Analyzing Elon Musk Tweets using MinIO, Apache Airflow, Apache Drill and Apache Superset

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python apache-airflow discussion

Log in or Post with

Python apache-airflow related posts

Index

What are some of the best open-source apache-airflow projects in Python? This list will help you:

# Project Stars
1 Airflow 45,711
2 elyra 1,993
3 airflow-maintenance-dags 1,770
4 astronomer-cosmos 1,214
5 couler 941
6 ethereum-etl-airflow 441
7 agents 381
8 airflow-chart 297
9 MCP-Airflow-API 48
10 covid-19-data-engineering-pipeline 24
11 e2e-structured-streaming 21
12 F2-Data-Pipeline 10
13 twitter_data-lakehouse_minio_drill_superset 5

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?