Python Pipelines

Open-source Python projects categorized as Pipelines

Top 20 Python Pipeline Projects

  1. Taipy

    Turns Data and AI algorithms into production-ready web applications in no time.

    Project mention: Top 40 Open-source Developer Tools with the Most GitHub Stars | dev.to | 2025-04-20

    GitHub: https://github.com/Avaiga/taipy

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. Mage

    πŸ§™ The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai

    Project mention: Wk 3 Orchestration: MLOPs with DataTalks | dev.to | 2025-02-22

    Here, we use the free Mage Ai orchestration tool.

  4. zenml

    ZenML πŸ™: The bridge between ML and Ops. https://zenml.io.

    Project mention: Accelerating ML Development with DevPods and ModelKits | dev.to | 2025-01-28

    Seamless integration: Works with OCI-compliant registries (e.g., Docker Hub and Jozu Hub) and integrates with popular tools like HuggingFace, ZenML, and Git.

  5. polyaxon

    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

  6. ploomber

    The fastest ⚑️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

  7. meltano

    Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

  8. elyra

    Elyra extends JupyterLab with an AI centric approach.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. covalent

    Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments. (by AgnostiqHQ)

  11. azure-devops-cli-extension

    Azure DevOps Extension for Azure CLI

  12. pypyr automation task runner

    pypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.

  13. pipefunc

    Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows πŸ•ΈοΈπŸ§ͺ

    Project mention: PipeFunc: Ultra Simple DAG Pipelines in Python with 15Β΅s Overhead for Science | news.ycombinator.com | 2024-12-22

    β€’ Any scenario involving interconnected functions where performance and ease of use are important

    I'd appreciate any feedback, especially regarding performance, usability, and potential applications in different scientific domains.

    Links:

    Documentation: https://pipefunc.readthedocs.io

  14. sparktorch

    Train and run Pytorch models on Apache Spark.

  15. patterns-devkit

    Data pipelines from re-usable components

  16. xontrib-pipeliner

    Let your pipe lines flow thru the Python code in xonsh.

  17. pipeline-runner

    Tool to run Bitbucket pipelines locally

  18. SmartPipeline

    A framework for rapid development of robust data pipelines following a simple design pattern

  19. panda_patrol

  20. orinoco

    Functional composable pipelines allowing clean separation of the business logic and its implementation

  21. pipelines

    Create Async Processing Pipelines Quick! (by theboxahaan)

  22. m42pl-core

    A data manipulation language with a focus on flexibility and simplicity.

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Pipelines discussion

Log in or Post with

Python Pipelines related posts

  • PipeFunc: Ultra Simple DAG Pipelines in Python with 15Β΅s Overhead for Science

    3 projects | news.ycombinator.com | 22 Dec 2024
  • This Week In Python

    5 projects | dev.to | 11 Oct 2024
  • Pipefunc: Minimalist DAG-Based Pipeline Management in Pure Python

    1 project | news.ycombinator.com | 11 Sep 2024
  • PipelineC Example: FM Radio Demodulation (FPGA SDR)

    2 projects | news.ycombinator.com | 3 Mar 2024
  • Generate non-CPU FPGA circuits from a C-like language

    1 project | news.ycombinator.com | 24 Nov 2023
  • What makes C, Verilog, Java, Python, etc. so different?

    1 project | /r/ECE | 8 Jun 2023
  • What are your private FPGA projects and why?

    2 projects | /r/FPGA | 31 May 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 12 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more β†’

Index

What are some of the best open-source Pipeline projects in Python? This list will help you:

# Project Stars
1 Taipy 18,047
2 Mage 8,301
3 zenml 4,571
4 polyaxon 3,634
5 ploomber 3,569
6 meltano 2,050
7 elyra 1,918
8 covalent 826
9 azure-devops-cli-extension 650
10 pypyr automation task runner 627
11 pipefunc 357
12 sparktorch 339
13 patterns-devkit 108
14 xontrib-pipeliner 59
15 pipeline-runner 55
16 SmartPipeline 27
17 panda_patrol 21
18 orinoco 11
19 pipelines 4
20 m42pl-core 4

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?