SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 C++ Parallel Projects
-
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Project mention: 《LightGBM: 一种高效的梯度提升决策树算法》论文(A Highly Efficient Gradient Boosting Decision Tree) | dev.to | 2025-10-15代码:https://github.com/microsoft/LightGBM
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
- https://youtu.be/D4Mn_5MdnDc
If you need to scale data access with whatever tool, it might be worth a look. XRootD is most often used with ROOT (https://root.cern), but can be used more generically as well. It powers CERN's EOS distributed filesystem, for example, where CERN stores all physics data from the major experiments at the LHC.
*disclaimer, I'm the release manager for XRootD and work in the storage and data management group at CERN.
-
parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
-
thread-pool
BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library
-
-
-
-
-
libfork
A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines
-
-
Project mention: Show HN: Utl:parallel – Work-stealing concurrency library for C++17 | news.ycombinator.com | 2025-08-03
-
-
elbencho
A distributed storage benchmark for file systems, object stores & block devices with support for GPUs
-
-
-
-
-
-
Project mention: # BioDynamo v2 Brings GPU Acceleration to Large-Scale Cell Simulations | dev.to | 2025-07-02
-
ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
I was asked this a few months back but don’t have the measurements fresh anymore. In general, I think TBB is one of the more thorough and feature-rich parallelism libraries out there. That said, I just found a comparable usage example in my benchmarks, and it doesn’t look like TBB will have the same low-latency profile as Fork Union: https://github.com/ashvardanian/ParallelReductionsBenchmark/...
-
-
C++ Parallel discussion
C++ Parallel related posts
-
Vc: SIMD Vector Classes for C++
-
Parsync, a tool for parallel SSH transfers – 7x faster than rsync
-
Show HN: Utl:parallel – Work-stealing concurrency library for C++17
-
Show HN: Yet another JSON lib (C++17)
-
Understanding SIMD: Infinite Complexity of Trivial Problems
-
You can use C-Reduce for any language
-
Automated Test-Case Reduction
-
A note from our sponsor - SaaSHub
www.saashub.com | 5 Jun 2026
Index
What are some of the best open-source Parallel projects in C++? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | LightGBM | 18,425 |
| 2 | Taskflow | 11,986 |
| 3 | root | 3,217 |
| 4 | parallel-hashmap | 3,186 |
| 5 | thread-pool | 2,997 |
| 6 | moose | 2,256 |
| 7 | HiGHS | 1,645 |
| 8 | Vc | 1,516 |
| 9 | RaftLib | 995 |
| 10 | libfork | 877 |
| 11 | libgrape-lite | 441 |
| 12 | UTL | 334 |
| 13 | YACLib | 293 |
| 14 | elbencho | 269 |
| 15 | rangeless | 201 |
| 16 | rocPRIM | 176 |
| 17 | ips4o | 167 |
| 18 | fast-cma-es | 162 |
| 19 | firebuild | 150 |
| 20 | biodynamo | 133 |
| 21 | ParallelReductionsBenchmark | 118 |
| 22 | charly-vm | 100 |
| 23 | librapidarchive | 86 |