Java Distributed Systems

Open-source Java projects categorized as Distributed Systems

Top 20 Java Distributed System Projects

  • advanced-java

    😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识

  • Dubbo

    The java implementation of Apache Dubbo. An RPC and microservice framework.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Apache ZooKeeper

    Apache ZooKeeper

  • Project mention: On Implementation of Distributed Protocols | dev.to | 2024-04-05

    Apache ZooKeeper — a distributed coordination, synchronization, and configuration service (written in Java);

  • conductor

    Conductor is an event driven orchestration platform (by conductor-oss)

  • Project mention: Show HN: Hatchet – Open-source distributed task queue | news.ycombinator.com | 2024-03-08
  • Trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

  • Project mention: Trino: Fast distributed SQL query engine for big data analytics | news.ycombinator.com | 2024-03-19
  • Hazelcast

    Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

  • Project mention: Does anyone know any good java implementations for distributed key-value store? | /r/ExperiencedDevs | 2023-06-08

    You're probably looking for Hazelcast here. Note that it does much more than just a distributed k/v, but it will get you where you need to go.

  • py4j

    Py4J enables Python programs to dynamically access arbitrary Java objects

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • dslabs

    Distributed Systems Labs and Framework

  • Project mention: The leadership myth in replicated databases (2023) | news.ycombinator.com | 2024-02-06

    I recently took a Distributed Systems course, and I also thought it was very interesting and unexpected how in the most basic form of Paxos, there is no concept of node roles or hierarchy like leader/follower, master/replica, etc. The base case is that all nodes have the same replicated log, and are "writers" capable of initiating changes to the log.

    This youtube video was particularly helpful in learning about consensus algorithms, specifically Paxos/MultiPaxos:

    https://www.youtube.com/watch?v=JEpsBg0AO6o

    John Ousterhout (author of Raft) walks through Paxos/MultiPaxos as outlined by Leslie Lamport, and then talks about a series of optimizations to improve performance. One key optimization is transitioning from performing consensus on a single log-slot proposal, to the entire log-slot altogether, which mitigates failed consensus rounds, and is where concepts like leader nodes emerge. If your familiar with Raft and Paxos, while listening to these optimization applied to MultiPaxos, you can kind of notice it to begin to resemble Raft.

    The course I took was through Georgia Tech, but was largely based around a framework developed at the University of Washington called dslabs:

    https://github.com/emichael/dslabs

    It was super informative for my learning about the foundations of distributed systems, namely consensus algorithms. I'd highly recommend it for anyone interested in learning more. Although fair warning, the programming assignments were quite difficult and time consuming.

  • ScaleCube

    Microservices library - scalecube-services is a high throughput, low latency reactive microservices library built to scale. it features: API-Gateways, service-discovery, service-load-balancing, the architecture supports plug-and-play service communication modules and features. built to provide performance and low-latency real-time stream-processing

  • swim

    Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs

  • scalardb

    Universal transaction manager

  • Sparkler

    Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

  • MicroRaft

    Feature-complete implementation of the Raft consensus algorithm in Java

  • diztl

    Share, discover & download files in your network 💥

  • nosqlbench

    The open source, pluggable, nosql benchmarking suite.

  • Project mention: How a Single Line of Code Made a 24-Core Server Slower Than a Laptop | news.ycombinator.com | 2023-06-17

    Not directly related, but https://github.com/nosqlbench/nosqlbench is very flexible benchmark tool for Cassandra and other distributed systems

  • memq

    MemQ is an efficient, scalable cloud native PubSub system

  • Project mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
  • kafka-workflow

    Simple Workflow As Code on Kafka

  • Vector-Clock

    An implementation of Vector Clock in Java :alarm_clock: (by varunu28)

  • Project mention: Vector Clocks: So what time is it? | /r/hypeurls | 2023-07-06
  • kafka-streams-demo

    Kafka streams demo

  • Multi-threaded-Dictionary-Server

    A Multi-threaded Dictionary Server that allows concurrent clients to search the meaning of a words, add new words, and remove an existing words.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java Distributed Systems related posts

Index

What are some of the best open-source Distributed System projects in Java? This list will help you:

Project Stars
1 advanced-java 74,203
2 Dubbo 40,021
3 Apache ZooKeeper 11,925
4 conductor 10,230
5 Trino 9,552
6 Hazelcast 5,861
7 py4j 1,144
8 dslabs 1,109
9 ScaleCube 601
10 swim 473
11 scalardb 438
12 Sparkler 409
13 MicroRaft 190
14 diztl 172
15 nosqlbench 159
16 memq 111
17 kafka-workflow 10
18 Vector-Clock 10
19 kafka-streams-demo 3
20 Multi-threaded-Dictionary-Server 3

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com