Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more! Learn more →
Top 23 Python Distributed Computing Projects
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning. (by pytorch)
-
fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
-
-
vizier
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
-
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
couler
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
-
-
-
tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark (by CamDavidsonPilon)
-
machinaris
An easy-to-use WebUI for crypto plotting and farming. Offers Bladebit, Gigahorse, MadMax, Chiadog and Plotman in a Docker container. Supports Chia, MMX, Chives, Flax, and HDDCoin among others.
-
-
-
stable-diffusion-webui-distributed
Chains stable-diffusion-webui instances together to facilitate faster image generation.
-
-
mlToolKits
learningOrchestra is a distributed Machine Learning integration tool that facilitates and streamlines iterative processes in a Data Science project.
-
redis-dict
Python dictionary with Redis as backend, built for large datasets. Simplifies Redis operations for large-scale and distributed systems. Supports various data types, namespacing, pipelining, and expiration.
It handles types without Pickle since remote pickled data is unsafe. Built for working with large datasets, it implements the full dictionary interface with extensive test coverage.
GitHub: https://github.com/Attumm/redis-dict
-
-
FindTheMag2
A tool to determine optimal projects for Gridcoin & BOINC crunchers. Maximize your magnitude!
-
-
py-inventa
A Python library for microservice registry and executing RPC (Remote Procedure Call) over Redis.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Distributed Computing discussion
Python Distributed Computing related posts
-
Show HN: RedisDict
-
Show HN: Interactive Graph by LLM (GPT-4o)
-
Daft: A High-Performance Distributed Dataframe Library for Multimodal Data
-
about making a game.
-
TIL : about the game "Foldit", a puzzle game about protein folding. In 2011, its gamers helped decipher a protein of a HIV-like virus, solving a scientific problem that went unsolved for 15 years in as little as 10 days.
-
Alternatives to Kaggle and Collab?
-
Shuffling large data at constant memory in Dask
-
A note from our sponsor - Sevalla
sevalla.com | 2 Sep 2025
Index
What are some of the best open-source Distributed Computing projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | ColossalAI | 41,121 |
2 | catalyst | 3,354 |
3 | rl | 3,025 |
4 | fugue | 2,105 |
5 | distributed | 1,648 |
6 | vizier | 1,592 |
7 | AI-Horde | 1,272 |
8 | couler | 940 |
9 | bagua | 882 |
10 | openfederatedlearning | 800 |
11 | tdigest | 398 |
12 | machinaris | 344 |
13 | sparktorch | 339 |
14 | arkouda | 270 |
15 | stable-diffusion-webui-distributed | 183 |
16 | wrapyfi | 77 |
17 | mlToolKits | 76 |
18 | redis-dict | 74 |
19 | tune | 35 |
20 | FindTheMag2 | 33 |
21 | rxray | 12 |
22 | py-inventa | 9 |
23 | hulse-py | 6 |