Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Top 12 C++ Distributed Projects
-
-
Project mention: Worldwide building footprints derived from satellite imagery from Microsoft | reddit.com/r/gis | 2022-05-20
-
Sonar
Write Clean C++ Code. Always.. Sonar helps you commit clean C++ code every time. With over 550 unique rules to find C++ bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Project mention: Use whatever is best for the problem, but still | reddit.com/r/datascience | 2022-08-09LGBM doesn't do RF well, but it's easy to manually bag single LGBM trees.
-
NebulaGraph Database
A distributed, fast open-source graph database featuring horizontal scalability and high availability (by vesoft-inc)
A NoSQL graph database is a type of non-relational, distributed database which employs a graph model. NoSQL stands for “Not only SQL” and refers to a new breed of databases that differ from traditional relational databases in their data model and performance. Graph databases are especially useful for data associated with relationships—everything from friendships on social netwo#rks to equipment supply chains or business processes. They can quickly traverse vast amounts of linked data points to discover insights and hidden connections between entities, making them ideal for network analysis– such as financial fraud detection, recommendation engines and many other use cases– all while performing at scale.
-
oceanbase
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
Project mention: Oceanbase: Enterprise scalable distributed highly available relational database | news.ycombinator.com | 2023-03-21 -
Project mention: OneFlow v0.9.0 Came Out!——A Distributed Deep Learning Framework | reddit.com/r/programming | 2023-02-12
-
https://github.com/STEllAR-GROUP/hpx Welcoming community + we have been part of GSoC for 4-5 years now so feel free to apply there when it opens ;)
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
incubator-kvrocks
Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
-
Brief Overview for any interested: Vineyard (v6d) is an in-memory immutable data manager that provides out-of-the-box high-level abstraction and zero-copy in-memory sharing for distributed data in big data tasks, such as graph analytics (e.g., GraphScope), numerical computing (e.g., Mars), and machine learning.
-
elbencho
A distributed storage benchmark for file systems, object stores & block devices with support for GPUs
-
areg-sdk
Interface-centric, cross-platform communication engine that simplifies multiprocessing and multithreading application development.
My closer introduction with these 2 tools started from OSS areg sdk project. Because of some experienced friend recommendation, i started with make. The main reason was that it is more spread in embedded. No other weighty argument they had. After having make, i decided to integrate cmake. Suddenly i figured out that cmake for me is more understandable and powerful. It already has many features that makes things easier. The biggest advantage for me is that in comparison to make / Makefile, lot of IDE support build with cmake. The cross-platform / cross-compile for areg-sdk is important feature, and this is easier to achieve with cmake. For example, I can compile with make under Linux with gcc/clang, but under windows can compile in cygwin environment and not with MSVC, which is not the issue in case of cmake.
-
C++ Distributed related posts
- Oceanbase: Enterprise scalable distributed highly available relational database
- OneFlow v0.9.0 Came Out!——A Distributed Deep Learning Framework
- OneFlow v0.9.0 Came Out!
- What are some C++ projects with high quality code that I can read through?
- What is a NoSQL Graph Database?
- Graph Database vs Relational Database: What to Choose?
- Using NebulaGraph Importer to Import Data into NebulaGraph Database
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Mar 2023
Index
What are some of the best open-source Distributed projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | tensorflow | 172,321 |
2 | CNTK | 17,337 |
3 | LightGBM | 14,777 |
4 | NebulaGraph Database | 8,891 |
5 | oceanbase | 5,395 |
6 | oneflow | 4,623 |
7 | HPX | 2,123 |
8 | incubator-kvrocks | 1,995 |
9 | v6d | 692 |
10 | elbencho | 87 |
11 | areg-sdk | 85 |
12 | oxen-storage-server | 22 |