C++ Distributed Systems

Open-source C++ projects categorized as Distributed Systems

Top 12 C++ Distributed System Projects

Distributed Systems
  • xgboost

    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

    Project mention: stackgbm VS xgboost - a user suggested alternative | libhunt.com/r/stackgbm | 2024-05-05
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • NebulaGraph Database

    A distributed, fast open-source graph database featuring horizontal scalability and high availability (by vesoft-inc)

  • pixie

    Instant Kubernetes-Native Application Observability

    Project mention: Grafana Beyla: OSS eBPF auto-instrumentation for application observability | news.ycombinator.com | 2023-09-13
  • service-fabric

    Service Fabric is a distributed systems platform for packaging, deploying, and managing stateless and stateful distributed applications and containers at large scale.

  • blazingmq

    A modern high-performance open source message queuing system

    Project mention: BlazingMQ - High performant Open Source Message Queue by Bloomberg | /r/opensource | 2023-08-02
  • curve

    Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file storage. (by opencurve)

  • NuRaft

    C++ implementation of Raft core logic as a replication library

    Project mention: ClickHouse Keeper: A ZooKeeper alternative written in C++ | news.ycombinator.com | 2023-09-27

    That's true - C++ libraries are typically bug-ridden and require exhaustive efforts to clean up.

    But the latest bugs found by ClickHouse continuous integration system in the related library were fixed about a year ago:


  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • lizardfs

    LizardFS is an Open Source Distributed File System licensed under GPLv3.

  • v6d

    vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

  • nebula

    A distributed block-based data storage and compute engine (by varchar-io)

    Project mention: Show HN: Interactive Graph by LLM (GPT-4o) | news.ycombinator.com | 2024-05-19

    So it's a pretty simple wrapper of LLM model in use (currently gpt-4o), it does not add much technical stuff in it.

    It does not use database for any "random search", but yes, columns.ai is a data analytics tool that allows you to connect supported live data sources like Google Spreadsheet, Airtable, Notion Database to create visual stories.

    The analytics engine is home built (https://github.com/varchar-io/nebula) but it is not a database. And I don't use LLM agents, just build logic how to purify data returned by LLM, and fit them into an optimized visualization.

    Hope I answered your question!

  • ScaleStore

    This is the source code for our (Tobias Ziegler, Carsten Binnig and Viktor Leis) published paper at SIGMOD’22: ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA.

    Project mention: Ask HN: Why are there no open source NVMe-native key value stores in 2023? | news.ycombinator.com | 2023-10-16

    I don't remember exactly why I have any of them saved, but these are some experimental data stores that seems to be fitting what you're looking for somewhat:

    - https://github.com/DataManagementLab/ScaleStore - "A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA"

    - https://github.com/unum-cloud/udisk - "The fastest ACID-transactional persisted Key-Value store designed for NVMe block-devices with GPU-acceleration and SPDK to bypass the Linux kernel."

    - https://github.com/capsuleman/ssd-nvme-database - "Columnar database on SSD NVMe"

  • C++ Middleware Writer

    The repo contains library code to support messaging and serialization. There are also two programs in the repo that are needed to use the CMW.

    Project mention: What led you to use Linux as your daily driver? | /r/linuxquestions | 2023-12-07

    I started with Linux in the late 90s. I switched to FreeBSD around 2013 and returned to Linux a couple of years ago. Io_uring was the main reason I had to come back. At first I ported the back tier of my code generator back to Linux and then I ported the middle tier from being POSIX based to Linux.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ Distributed Systems discussion

Log in or Post with

C++ Distributed Systems related posts


What are some of the best open-source Distributed System projects in C++? This list will help you:

Project Stars
1 xgboost 25,748
2 NebulaGraph Database 10,308
3 pixie 5,358
4 service-fabric 3,013
5 blazingmq 2,492
6 curve 2,254
7 NuRaft 963
8 lizardfs 946
9 v6d 812
10 nebula 152
11 ScaleStore 111
12 C++ Middleware Writer 60

Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.