The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Distributed Computing Open-Source Projects
-
oceanbase
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Hazelcast
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
-
Akka.net
Canonical actor model implementation for .NET with local + distributed actors in C# and F#.
-
gleam
Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly. (by chrislusf)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
-
MooseFS
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
-
zenoh
zenoh unifies data in motion, data in-use, data at rest and computations. It carefully blends traditional pub/sub with geo-distributed storages, queries and computations, while retaining a level of time and space efficiency that is well beyond any of the mainstream stacks.
-
vizier
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
-
SmartSql
SmartSql = MyBatis in C# + .NET Core+ Cache(Memory | Redis) + R/W Splitting + PropertyChangedTrack +Dynamic Repository + InvokeSync + Diagnostics
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: OceanBase – An open-source distributed SQL database written in C++ | news.ycombinator.com | 2023-05-23
Project mention: Does anyone know any good java implementations for distributed key-value store? | /r/ExperiencedDevs | 2023-06-08You're probably looking for Hazelcast here. Note that it does much more than just a distributed k/v, but it will get you where you need to go.
Project mention: Is there a programming language that will blow my mind? | /r/ProgrammingLanguages | 2023-06-01https://github.com/asynkron/protoactor-go & this is a great lib, that implements a Erlang/Akka-like the Actor Model in Go.
akka.net actors. Actors all the way! https://getakka.net
Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09
Project mention: GreptimeAI + Xinference - Efficient Deployment and Monitoring of Your LLM Applications | dev.to | 2024-01-24Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications. It provides a RESTful API compatible with OpenAI API, Python SDK, CLI, and WebUI. Furthermore, it integrates third-party developer tools like LangChain, LlamaIndex, and Dify, facilitating model integration and development.
The only way I can foresee a cryptocoin actually holding value is if spending the coin meant spending processing cycles and RAM doing things like this:
https://en.wikipedia.org/wiki/List_of_volunteer_computing_pr...
But in more general sense, less like https://boinc.berkeley.edu/ and more like AWS...
It's the only way to have value, actually holding computing power in a distributed network.
There are benchmarks here - https://github.com/Eventual-Inc/Daft?tab=readme-ov-file#benc.... Seems to outperform Dask by a fair bit.
Distributed Computing related posts
- Bitcoin Block 840000
- Distributed Grok-1 (314B)
- Show HN: Achieving Consensus with Go – A Raft Implementation
- Show HN: Acheiving Consensus with Go – A Raft Implementation
- Show HN: Distributed Llama – Run LLMs on multiple devices in parallel
- Distributed Llama
- Distributed Inference and Fine-Tuning of Large Language Models over the Internet
-
A note from our sponsor - WorkOS
workos.com | 23 Apr 2024
Index
What are some of the best open-source Distributed Computing projects? This list will help you:
Project | Stars | |
---|---|---|
1 | ColossalAI | 37,836 |
2 | oceanbase | 7,340 |
3 | Hazelcast | 5,861 |
4 | protoactor-go | 4,862 |
5 | Akka.net | 4,612 |
6 | gleam | 3,362 |
7 | catalyst | 3,223 |
8 | alpa | 2,979 |
9 | inference | 2,424 |
10 | Graph Engine | 2,176 |
11 | boinc | 1,915 |
12 | fugue | 1,869 |
13 | Daft | 1,666 |
14 | protoactor-dotnet | 1,658 |
15 | MooseFS | 1,583 |
16 | distributed | 1,540 |
17 | hashtopolis | 1,349 |
18 | zenoh | 1,243 |
19 | vizier | 1,171 |
20 | .NET port of LMAX Disruptor | 1,160 |
21 | IdGen | 1,123 |
22 | holochain | 1,108 |
23 | SmartSql | 1,045 |
Sponsored