Python Stream Processing

Open-source Python projects categorized as Stream Processing

Top 18 Python Stream Processing Projects

Stream Processing
  1. pathway

    Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

    Project mention: pathway VS cocoindex - a user suggested alternative | libhunt.com/r/pathway | 2025-04-01
  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. Faust

    Python Stream Processing

  4. river

    🌊 Online machine learning in Python

  5. faststream

    FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

    Project mention: FastStream: A powerful library for building services with event streams | news.ycombinator.com | 2024-10-29

    FastStream (https://github.com/airtai/faststream) simplifies the process of writing producers and consumers for message queues, handling all the parsing, networking and documentation generation automatically. It is a new package based on the ideas and experiences gained from FastKafka and Propan. By joining our forces, we picked up the best from both packages and created a unified way to write services capable of processing streamed data regardless of the underlying protocol. We'll continue to maintain both packages, but new development will be in this project.

    Making streaming microservices has never been easier. Designed with junior developers in mind, FastStream simplifies your work while keeping the door open for more advanced use cases. Here's a look at the core features that make FastStream a go-to framework for modern, data-centric microservices.

    Multiple Brokers: FastStream provides a unified API to work across multiple message brokers (Apache Kafka, RabbitMQ, NATS and Redis)

  6. bytewax

    Python Stream Processing

    Project mention: Roast my new Python library for stream processing | news.ycombinator.com | 2025-04-03

    Interesting! How do you see this comparing with Bytewax - https://github.com/bytewax/bytewax

  7. quix-streams

    A Python library for building containerized ML and Generative AI applications with Apache Kafka.

    Project mention: Show HN: Denormalized – Embeddable Stream Processing in Rust and DataFusion | news.ycombinator.com | 2024-08-15

    Congratulations on launching your project! We spoke back in March at a Kafka Summit London social meetup and talked all things Python and Kafka (I work on https://github.com/quixio/quix-streams). Always great to see a new stream processing project tackle a new segment

  8. radio-active

    Play any radios around the globe right from the terminal :zap:

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. csp

    csp is a high performance reactive stream processing library, written in C++ and Python

    Project mention: Csp: Reactive stream processing in C++ and Python | news.ycombinator.com | 2025-03-14
  11. zef

    Toolkit for graph-relational data across space and time (by zefhub)

  12. kafkaml-anomaly-detection

    Project for real-time anomaly detection using Kafka and python

  13. volga

    Real-time data processing/feature engineering in Python. Tailored for modern AI/ML systems.

  14. gretel-python-client

    The Gretel Python Client allows you to interact with the Gretel REST API.

  15. arroyo

    A library to build streaming applications that consume from and produce to Kafka. (by getsentry)

  16. makinage

    Stream Processing Made Easy

  17. beavers

    Python stream processing for analytics

    Project mention: Build a real-time crypto analytics dashboard with Beavers and Perspective | dev.to | 2024-07-25

    This post shows how you can leverage two powerful python libraries, Beavers and Perspective, to analyse data in realtime and display it in a dashboard.

  18. rxsci

    ReactiveX for data science

  19. example-real-time-ai-alerts

    Real-Time Alerts with AI, NATs and Streamlit

    Project mention: How to Build a Streaming Deduplication Pipeline with Kafka, GlassFlow, and ClickHouse | dev.to | 2025-05-14

    GlassFlow: Our processing engine. GlassFlow will read data from Kafka, check for duplicates, and write to ClickHouse.

  20. pipelines

    Create Async Processing Pipelines Quick! (by theboxahaan)

  21. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Stream Processing discussion

Log in or Post with

Python Stream Processing related posts

  • Roast my new Python library for stream processing

    1 project | news.ycombinator.com | 3 Apr 2025
  • Show HN: Bytewax, JVM free stream processing Py+Rust

    1 project | news.ycombinator.com | 17 Oct 2024
  • Show HN: Streaming DataFrames–a Pandas-like syntax for real-time data

    1 project | news.ycombinator.com | 23 Apr 2024
  • Building a streaming SQL engine with Arrow and DataFusion

    1 project | news.ycombinator.com | 18 Mar 2024
  • FLaNK AI Weekly 18 March 2024

    39 projects | dev.to | 18 Mar 2024
  • Airflow VS quix-streams - a user suggested alternative

    2 projects | 7 Dec 2023
  • faust VS quix-streams - a user suggested alternative

    2 projects | 7 Dec 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 23 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Stream Processing projects in Python? This list will help you:

# Project Stars
1 pathway 25,345
2 Faust 6,791
3 river 5,338
4 faststream 3,931
5 bytewax 1,739
6 quix-streams 1,377
7 radio-active 535
8 csp 292
9 zef 114
10 kafkaml-anomaly-detection 59
11 volga 58
12 gretel-python-client 56
13 arroyo 52
14 makinage 40
15 beavers 38
16 rxsci 14
17 example-real-time-ai-alerts 9
18 pipelines 4

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?