Top 11 C++ Distributed Projects
An Open Source Machine Learning Framework for EveryoneProject mention: Top Github repo trends in 2021 | dev.to | 2022-01-12
No surprises here: deep learning is the most popular subcategory, with hugging face transformers repo, YOLOv5, Tensorflow and Deepmind’s Alphafold all in the mix. Surprisingly, the only proper infrastructure-ey repos on the list are Meilisearch and Clickhouse, a tad bit surprising given all the hype data infrastructure receives in VC-world, but again, probably just a question of size of end-user populations + whether data scientists spend tons of time on Github vs. Web Developers…
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit (by microsoft)
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.Project mention: Writing the fastest GBDT libary in Rust | dev.to | 2022-01-11
Here are our benchmarks on training time comparing Tangram's Gradient Boosted Decision Tree Library to LightGBM, XGBoost, CatBoost, and sklearn.
A distributed, fast open-source graph database featuring horizontal scalability and high availability (by vesoft-inc)Project mention: Embeddable graph database | reddit.com/r/golang | 2022-01-03
One option is NebulaGraph if your looking for low latency, scalability, and HA.
qTox is a chat, voice, video, and file transfer IM client using the encrypted peer-to-peer Tox protocol.Project mention: Barroso não sabe o que é VPN, e se soubesse, rasgaria o boga de tanta raiva. 😂 | reddit.com/r/brasilivre | 2022-01-10
OneFlow is a performance-centered and open-source deep learning framework.Project mention: Optimization of CUDA Elementwise Template Library: Practical, Efficient, and Extensible | reddit.com/r/CUDA | 2022-01-20
Elementwise operation refers to applying a function transformation to every element of a tensor. In deep learning, many operators can be regraded as elementwise operators, such as common activation functions (like ReLU and GELU) and ScalarMultiply (multiplying each element of a tensor by a scalar). For this elementwise operation, OneFlow(https://github.com/Oneflow-Inc/oneflow/) abstracts a CUDA template. this article will introduce the design thoughts and optimization techniques of CUDA template.
The C++ Standard Library for Parallelism and ConcurrencyProject mention: Would anyone be interested in an HPC coroutine library for MPI? | reddit.com/r/cpp | 2021-10-07
We're working on something similar, but based on sender/receiver in HPX (a lightweight threading runtime) and DLA-Future (distributed linear algebra currently based on (HPX) futures; based on sender/receiver in the future). With senders-as-awaitables this would also get you coroutine support for asynchronous MPI calls for free. We don't have that yet, but it's planned. In the meantime libunifex should be able to fill in the gaps.
OPS - Build and Run Open Source Unikernels. Quickly and easily build and deploy open source unikernels in tens of seconds. Deploy in any language to any cloud.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF)Project mention: [P] Bridging Dask and Tensorflow for distributed machine learniing with Vineyard | reddit.com/r/MachineLearning | 2021-09-08
We propose vineyard, https://github.com/v6d-io/v6d to address such challenges, which, provides efficient zero-copy data sharing between different compute engines, without extra cost of copying and serialization, compared other similar solutions.
A distributed storage benchmark for file systems, object stores & block devices with support for GPUsProject mention: [HELP] Nvidia GPUDirect storage benchmark for an AI400 system | reddit.com/r/HPC | 2021-11-05
You can also use elbencho (https://github.com/breuner/elbencho) which is functionally equivalent to IOR but a little more flexible.
The most efficient interface-centric real-time communication engine to connect Things at the edge of the network.Project mention: Multithreading, multiprocessing or single thread application switch within a minute with homogeneous design and implementation, and other examples. | reddit.com/r/programming | 2021-11-09
Storage server for Oxen Service NodesProject mention: Weekly Dev Update 06/07/2021 | reddit.com/r/oxen_io | 2021-07-06
[Storage Server] Testing, bug fixes, and miscellaneous updates for the big 2.2.0 update https://github.com/oxen-io/oxen-storage-server/pull/433
C++ Distributed related posts
Optimization of CUDA Elementwise Template Library: Practical, Efficient, and Extensible
1 project | reddit.com/r/CUDA | 20 Jan 2022
Pytorch Distributed Parallel Computing or Hpc Research
1 project | reddit.com/r/deeplearning | 16 Jan 2022
How to Implement an Efficient LayerNorm CUDA Kernel[R]
1 project | reddit.com/r/MachineLearning | 7 Jan 2022
What an Optimal Point-to-Point Communication Library Should Be?
1 project | reddit.com/r/CUDA | 6 Jan 2022
Embeddable graph database
1 project | reddit.com/r/golang | 3 Jan 2022
How to Go Beyond Data Parallelism and Model Parallelism: Starting from GShard
1 project | news.ycombinator.com | 7 Dec 2021
How to Implement an Efficient Softmax CUDA Kernel
1 project | news.ycombinator.com | 6 Dec 2021
What are some of the best open-source Distributed projects in C++? This list will help you:
Are you hiring? Post a new remote job listing for free.