Java Distributed Systems

Open-source Java projects categorized as Distributed Systems

Top 20 Java Distributed System Projects

  • advanced-java

    😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识

  • Dubbo

    The java implementation of Apache Dubbo. An RPC and microservice framework.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • Apache ZooKeeper

    Apache ZooKeeper

    Project mention: Easy Guide to Integrating Kafka: Practical Solutions for Managing Blob Data | dev.to | 2024-01-22

    To use Kafka, we also need to deploy a service that keeps configuration informations such as Zookeeper.

  • Trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

    Project mention: Game analytic power: how we process more than 1 billion events per day | dev.to | 2023-11-24

    We decided not to waste time reinventing the wheel and simply installed Trino on our servers. It’s a full featured SQL query engine that works on your data. Now our analysts can use it to work with data from AppMetr and execute queries at different levels of complexity.

  • Hazelcast

    Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

    Project mention: Does anyone know any good java implementations for distributed key-value store? | /r/ExperiencedDevs | 2023-06-08

    You're probably looking for Hazelcast here. Note that it does much more than just a distributed k/v, but it will get you where you need to go.

  • conductor

    Conductor is a microservices orchestration engine. (by conductor-oss)

    Project mention: Open Source Workflow Orchestration Platform | news.ycombinator.com | 2024-02-08
  • py4j

    Py4J enables Python programs to dynamically access arbitrary Java objects

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • dslabs

    Distributed Systems Labs and Framework

    Project mention: The leadership myth in replicated databases (2023) | news.ycombinator.com | 2024-02-06

    I recently took a Distributed Systems course, and I also thought it was very interesting and unexpected how in the most basic form of Paxos, there is no concept of node roles or hierarchy like leader/follower, master/replica, etc. The base case is that all nodes have the same replicated log, and are "writers" capable of initiating changes to the log.

    This youtube video was particularly helpful in learning about consensus algorithms, specifically Paxos/MultiPaxos:

    https://www.youtube.com/watch?v=JEpsBg0AO6o

    John Ousterhout (author of Raft) walks through Paxos/MultiPaxos as outlined by Leslie Lamport, and then talks about a series of optimizations to improve performance. One key optimization is transitioning from performing consensus on a single log-slot proposal, to the entire log-slot altogether, which mitigates failed consensus rounds, and is where concepts like leader nodes emerge. If your familiar with Raft and Paxos, while listening to these optimization applied to MultiPaxos, you can kind of notice it to begin to resemble Raft.

    The course I took was through Georgia Tech, but was largely based around a framework developed at the University of Washington called dslabs:

    https://github.com/emichael/dslabs

    It was super informative for my learning about the foundations of distributed systems, namely consensus algorithms. I'd highly recommend it for anyone interested in learning more. Although fair warning, the programming assignments were quite difficult and time consuming.

  • ScaleCube

    Microservices library - scalecube-services is a high throughput, low latency reactive microservices library built to scale. it features: API-Gateways, service-discovery, service-load-balancing, the architecture supports plug-and-play service communication modules and features. built to provide performance and low-latency real-time stream-processing

  • swim

    Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs

  • scalardb

    Universal transaction manager

  • Sparkler

    Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

  • MicroRaft

    Feature-complete implementation of the Raft consensus algorithm in Java

  • diztl

    Share, discover & download files in your network 💥

  • nosqlbench

    The open source, pluggable, nosql benchmarking suite.

    Project mention: How a Single Line of Code Made a 24-Core Server Slower Than a Laptop | news.ycombinator.com | 2023-06-17

    Not directly related, but https://github.com/nosqlbench/nosqlbench is very flexible benchmark tool for Cassandra and other distributed systems

  • memq

    MemQ is an efficient, scalable cloud native PubSub system

    Project mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
  • kafka-workflow

    Simple Workflow As Code on Kafka

  • Vector-Clock

    An implementation of Vector Clock in Java :alarm_clock: (by varunu28)

    Project mention: Vector Clocks: So what time is it? | /r/hypeurls | 2023-07-06
  • kafka-streams-demo

    Kafka streams demo

  • Multi-threaded-Dictionary-Server

    A Multi-threaded Dictionary Server that allows concurrent clients to search the meaning of a words, add new words, and remove an existing words.

  • LearnThisRepo.com

    Learn 300+ open source libraries for free using AI. LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-08.

Java Distributed Systems related posts

Index

What are some of the best open-source Distributed System projects in Java? This list will help you:

Project Stars
1 advanced-java 73,356
2 Dubbo 39,801
3 Apache ZooKeeper 11,808
4 Trino 9,229
5 Hazelcast 5,766
6 conductor 3,838
7 py4j 1,136
8 dslabs 1,070
9 ScaleCube 597
10 swim 471
11 scalardb 435
12 Sparkler 409
13 MicroRaft 179
14 diztl 174
15 nosqlbench 156
16 memq 106
17 kafka-workflow 10
18 Vector-Clock 8
19 kafka-streams-demo 3
20 Multi-threaded-Dictionary-Server 3
Learn 300+ open source libraries for free using AI.
LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!
learnthisrepo.com