SaaSHub helps you find the best software and product alternatives Learn more →
Top 14 Python Dataflow Projects
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
marimo
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
Project mention: Pluto.jl 1.0 release – reactive notebook for Julia | news.ycombinator.com | 2026-06-03Pluto is great. I use it all the time. If you like the reactivity/reproducibility but are wedded to Python, you might want to check out Marimo, which is also great. [https://marimo.io/]
It too puts the output of a cell above the code so if you're unable to adapt to things that are different it's also probably not for you.
FWIW, Observable's Notebooks (Javascript) work the same way: output above the code that produces it. [https://observablehq.com/]
I too did not like having the output above the code but got over it pretty quickly. For plots, it's arguably better: usually, I want to see the plot before I see the 15 line invocation of some plot command. The thing that bugs me the most about Pluto now is that it really wants you to only have a single evaluating statement per cell. You have to wrap stuff in "block......end" if you want to e.g. define more than one variable in a cell.
-
-
Project mention: Bytewax: Stream processing library built using Python and Rust | news.ycombinator.com | 2026-05-22
-
Project mention: Representing Python notebooks as dataflow graphs | news.ycombinator.com | 2025-08-09
Even with data flow extension (also like ipyflow [0] ) I am still struggling with the execution model of notebooks in general. I often still see people defining functions and classes in notebooks to somehow handle prototyping loops.
I would love to see DAGs like in SSA form of compilers, that also supports loop operators. However, IMHO also the notebook interface needs to adjust for that (cell indentation ?). However, the strength of notebooks rather shows in document authoring like quarto, which IMHO mostly contradicts more complex controll flow.
[0] https://github.com/ipyflow/ipyflow
-
-
-
-
prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
-
entangle
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
-
-
-
-
Python Dataflow discussion
Python Dataflow related posts
-
Show HN: Marimo pair – reactive Python notebooks as environments for agents
-
Representing Python notebooks as dataflow graphs
-
It's 2025: Your Python Toolbox Is More Than Just PyCharm
-
pathway VS cocoindex - a user suggested alternative
2 projects | 1 Apr 2025 -
Can anyone tell if Xilinx's FINN (from Xilinx's research lab) is restricted for use only to xilinx based FPGAs?
-
flowsaber, a dataflow-based workflow package written in python
-
flowsaber, a dataflow-based workflow package written in python. It's extensible, and has a highly intuitive composing syntax, with native shell task support. The whole flows is linked and composed from channels and tasks, different runs of a task with different inputs will be scheduled and ran par
-
A note from our sponsor - SaaSHub
www.saashub.com | 14 Jun 2026
Index
What are some of the best open-source Dataflow projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | pathway | 63,006 |
| 2 | marimo | 21,378 |
| 3 | pyt | 2,206 |
| 4 | bytewax | 1,964 |
| 5 | ipyflow | 1,268 |
| 6 | pytm | 1,127 |
| 7 | finn | 1,007 |
| 8 | NIPY | 826 |
| 9 | prefect-deployment-patterns | 110 |
| 10 | entangle | 104 |
| 11 | flowsaber | 41 |
| 12 | krnel-graph | 22 |
| 13 | pglineage | 17 |
| 14 | m42pl-core | 4 |