Top 10 Python Dataflow Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
entangle
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
Project mention: Building a streaming SQL engine with Arrow and DataFusion | news.ycombinator.com | 2024-03-18
Project mention: Show HN: Marimo – an open-source reactive notebook for Python | news.ycombinator.com | 2024-01-12You're probably referring to nbgather (https://github.com/microsoft/gather), which shipped with VSCode for a while.
nbgather used static slicing to get all the code necessary to reconstruct some cell. I actually worked with Andrew Head (original nbgather author) and Shreya Shankar to implement something similar in ipyflow (but with dynamic slicing and a not-as-nice interface): https://github.com/ipyflow/ipyflow?tab=readme-ov-file#state-...
I have no doubt something like this will make its way into marimo's roadmap at some point :)
Project mention: Hi, What could be the best HLS tool for implementing neural networks on FPGA | /r/FPGA | 2023-06-13FINN - https://github.com/Xilinx/finn
Python Dataflow related posts
- Can anyone tell if Xilinx's FINN (from Xilinx's research lab) is restricted for use only to xilinx based FPGAs?
- flowsaber, a dataflow-based workflow package written in python
- flowsaber, a dataflow-based workflow package written in python. It's extensible, and has a highly intuitive composing syntax, with native shell task support. The whole flows is linked and composed from channels and tasks, different runs of a task with different inputs will be scheduled and ran par
Index
What are some of the best open-source Dataflow projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | pyt | 2,161 |
2 | bytewax | 1,144 |
3 | ipyflow | 1,073 |
4 | pytm | 836 |
5 | NIPY | 731 |
6 | finn | 661 |
7 | entangle | 105 |
8 | prefect-deployment-patterns | 93 |
9 | flowsaber | 40 |
10 | m42pl-core | 4 |
Sponsored