Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →
Top 23 Java Distributed System Projects
-
advanced-java
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
Let's look at the example from Apache Dubbo:
-
awesome-system-design-resources
Learn System Design concepts and prepare for interviews using free resources.
Project mention: 🔥 17 Best Free GitHub Repositories to Crack System Design Interviews 🛠️ | dev.to | 2024-12-0611. Awesome System Design Resources
-
conductor
Conductor is an event driven orchestration platform providing durable and highly resilient execution engine for your applications
Project mention: Netflix has open-sourced its Maestro Workflow Orchestrator | news.ycombinator.com | 2024-07-22I'm a bit confused about what is going on here: This project appears to use Netflix/conductor [0]. But you go to that repo, you see it has been archived, with a message saying it is replaced by Netflix's internal non-OSS version, and by unmentioned community forks – by which I assume they mean Orkes Conductor [1]. But this isn't using Orkes Conductor, it looks like it is using the discontinued Netflix version `com.netflix.conductor:conductor-core:2.31.5` [2] – and an outdated version of it too.
[0] https://github.com/Netflix/conductor
[1] https://github.com/conductor-oss/conductor
[2] https://github.com/Netflix/maestro/blob/e8bee3f1625d3f31d84d...
-
Zookeeper is a distributed coordination service used in older versions of Kafka to manage cluster metadata, leader election, and configuration. It ensures consistency and synchronization across Kafka brokers.
-
Project mention: Twitter's 600-Tweet Daily Limit Crisis: Soaring GCP Costs and the Open Source Fix Elon Musk Ignored | dev.to | 2025-04-10
Trino: Trino (formerly known as PrestoSQL) is a high-performance distributed SQL query engine designed for data analysis. It offers efficient querying capabilities across multiple data sources, including various file formats, databases, and data lakes. These are some interesting background story between Trino and Presto: Presto was the original name of the project, and it was developed by Facebook. In December 2020, a significant portion of the Presto community decided to fork the project and renamed it Trino. Read more here: Trino Blog.
-
Hazelcast
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
bookkeeper
Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads
-
You'd define invariants that must be met. This has been done before.
https://en.wikipedia.org/wiki/Search-based_software_engineer...
e.g. testing implementations of Paxos: https://github.com/emichael/dslabs
-
Project mention: Py4J: Enables Python programs to dynamically access arbitrary Java objects | news.ycombinator.com | 2025-04-12
-
ScaleCube
Microservices library - scalecube-services is a high throughput, low latency reactive microservices library built to scale. It features: API-Gateways, service-discovery, service-load-balancing, the architecture supports plug-and-play service communication modules and features. built to provide performance and low-latency real-time stream-processing
-
-
swim
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
-
-
Project mention: Kronotop: Redis-compatible, transactional document store backed by FoundationDB | news.ycombinator.com | 2025-01-20
-
-
pegasus
Pegasus Workflow Management System - Automate, recover, and debug scientific computations. (by pegasus-isi)
-
-
-
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Java Distributed Systems discussion
Java Distributed Systems related posts
-
Py4J: Enables Python programs to dynamically access arbitrary Java objects
-
System Design Resources
-
Dubbo: A Robust Java RPC and Microservice Framework
-
Maestro: Netflix's Workflow Orchestrator
-
Conductor – open-source event driven orchestration platform
-
RAG Explained | Using Retrieval-Augmented Generation to Build Semantic Search
-
Emerging Tech Trends 2024: The Latest Developments in AI, API, and Automation
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 19 Apr 2025
Index
What are some of the best open-source Distributed System projects in Java? This list will help you:
# | Project | Stars |
---|---|---|
1 | advanced-java | 77,469 |
2 | Dubbo | 40,904 |
3 | awesome-system-design-resources | 22,308 |
4 | conductor | 20,409 |
5 | Apache ZooKeeper | 12,451 |
6 | Trino | 11,124 |
7 | Hazelcast | 6,298 |
8 | bookkeeper | 1,922 |
9 | dslabs | 1,384 |
10 | py4j | 1,228 |
11 | ScaleCube | 622 |
12 | scalardb | 510 |
13 | swim | 493 |
14 | Sparkler | 412 |
15 | kronotop | 248 |
16 | MicroRaft | 232 |
17 | pegasus | 189 |
18 | diztl | 177 |
19 | nosqlbench | 176 |
20 | memq | 136 |
21 | kafka-delayed-queue | 39 |
22 | Vector-Clock | 11 |
23 | kafka-workflow | 9 |