Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 20 Java Distributed System Projects
-
advanced-java
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Hazelcast
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
ScaleCube
Microservices library - scalecube-services is a high throughput, low latency reactive microservices library built to scale. it features: API-Gateways, service-discovery, service-load-balancing, the architecture supports plug-and-play service communication modules and features. built to provide performance and low-latency real-time stream-processing
-
swim
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
-
Multi-threaded-Dictionary-Server
A Multi-threaded Dictionary Server that allows concurrent clients to search the meaning of a words, add new words, and remove an existing words.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Apache ZooKeeper — a distributed coordination, synchronization, and configuration service (written in Java);
Project mention: Show HN: Hatchet – Open-source distributed task queue | news.ycombinator.com | 2024-03-08
Project mention: Trino: Fast distributed SQL query engine for big data analytics | news.ycombinator.com | 2024-03-19
Project mention: Does anyone know any good java implementations for distributed key-value store? | /r/ExperiencedDevs | 2023-06-08You're probably looking for Hazelcast here. Note that it does much more than just a distributed k/v, but it will get you where you need to go.
Project mention: The leadership myth in replicated databases (2023) | news.ycombinator.com | 2024-02-06I recently took a Distributed Systems course, and I also thought it was very interesting and unexpected how in the most basic form of Paxos, there is no concept of node roles or hierarchy like leader/follower, master/replica, etc. The base case is that all nodes have the same replicated log, and are "writers" capable of initiating changes to the log.
This youtube video was particularly helpful in learning about consensus algorithms, specifically Paxos/MultiPaxos:
https://www.youtube.com/watch?v=JEpsBg0AO6o
John Ousterhout (author of Raft) walks through Paxos/MultiPaxos as outlined by Leslie Lamport, and then talks about a series of optimizations to improve performance. One key optimization is transitioning from performing consensus on a single log-slot proposal, to the entire log-slot altogether, which mitigates failed consensus rounds, and is where concepts like leader nodes emerge. If your familiar with Raft and Paxos, while listening to these optimization applied to MultiPaxos, you can kind of notice it to begin to resemble Raft.
The course I took was through Georgia Tech, but was largely based around a framework developed at the University of Washington called dslabs:
https://github.com/emichael/dslabs
It was super informative for my learning about the foundations of distributed systems, namely consensus algorithms. I'd highly recommend it for anyone interested in learning more. Although fair warning, the programming assignments were quite difficult and time consuming.
Project mention: How a Single Line of Code Made a 24-Core Server Slower Than a Laptop | news.ycombinator.com | 2023-06-17Not directly related, but https://github.com/nosqlbench/nosqlbench is very flexible benchmark tool for Cassandra and other distributed systems
Java Distributed Systems related posts
- Open Source Workflow Orchestration Platform
- The leadership myth in replicated databases (2023)
- Opsgenie Alerting with Orkes Conductor
- Conductor Open Source
- Netflix Conductor OSS discontinued support
- Show HN: Advent of Distributed Systems
- Orkes Monthly Highlights - October 2023
-
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024
Index
What are some of the best open-source Distributed System projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | advanced-java | 74,203 |
2 | Dubbo | 40,021 |
3 | Apache ZooKeeper | 11,925 |
4 | conductor | 10,230 |
5 | Trino | 9,552 |
6 | Hazelcast | 5,861 |
7 | py4j | 1,144 |
8 | dslabs | 1,109 |
9 | ScaleCube | 601 |
10 | swim | 473 |
11 | scalardb | 438 |
12 | Sparkler | 409 |
13 | MicroRaft | 190 |
14 | diztl | 172 |
15 | nosqlbench | 159 |
16 | memq | 111 |
17 | kafka-workflow | 10 |
18 | Vector-Clock | 10 |
19 | kafka-streams-demo | 3 |
20 | Multi-threaded-Dictionary-Server | 3 |
Sponsored