Are you a developer or a data analyst? Share your thoughts about your coding tools in our short survey and get a chance to win prizes! Learn more →
Top 23 C++ parallel-computing Projects
-
Project mention: Show HN: Coros – A Modern C++ Library for Task Parallelism | news.ycombinator.com | 2024-09-25
Martin, have you had a look at https://github.com/taskflow/taskflow ?
-
JetBrains
Tell us how you use coding tools. You may win a prize! Are you a developer or a data analyst? Share your thoughts about your coding tools in our short survey and get a chance to win prizes!
-
Thanks for the added context on the builds! As "foreign" BW player and fellow speech processing researcher, I agree shallow contextual biasing should help. While not difficult to implement, most generally available ASR solutions don't make it easy to use. There's a PR in ctranslate2 implementing the same feature so that it could be exposed in faster-whisper: https://github.com/OpenNMT/CTranslate2/pull/1789
-
kokkos
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
-
-
Project mention: Delivering the Missing Building Blocks for Nvidia CUDA Kernel Fusion in Python | news.ycombinator.com | 2025-07-16
There’s an extensive change-log supporting the CCCL 3.0 release on GitHub from 3 hours ago: https://github.com/NVIDIA/cccl/releases/tag/v3.0.0
-
Project mention: Understanding SIMD: Infinite Complexity of Trivial Problems | news.ycombinator.com | 2024-11-30
I'm surprised no one has mentioned Vc. I found ispc clunky and not as performant, and std::simd didn't support some useful math ops like rsqrt. Vc has been around for years, I have no trouble including it in my codes, it has masking and many of the most useful math ops, and I can get over 1 TF/s on a consumer-grade Ryzen and at least 3 TF/s on the big Epyc CPUs.
https://github.com/VcDevel/Vc
-
Kratos
Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface. (by KratosMultiphysics)
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
-
libfork
A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines
-
-
-
-
coros
An easy-to-use and fast library for task-based parallelism, utilizing coroutines. (by mtmucha)
Project mention: Show HN: Coros – A Modern C++ Library for Task Parallelism | news.ycombinator.com | 2024-09-25In your dequeue/circular buffer implementation, how is it able to grow the queue without locking?
The code seems to rely on atomics for head & tail, but grows the queue without any special provisions I can see.
https://github.com/mtmucha/coros/blob/ee30d3c1d0602c3071aa26...
-
-
-
areg-sdk
Remote objects act local. C++ Async RPC for threads and processes with auto-discovery and zero-boilerplate messaging.
Long time I was not here and I am very sure that none missed me :) I was busy developing new features of areg skd and creating user interface tool for areg sdk. And I must say, that working almost alone for more these 2 project, I made a huge progress. And what is very important, I have fun.
-
-
-
ConcurrentDeque
Fast, generalized, implementation of the Chase-Lev lock-free work-stealing deque for C++17
-
-
Lazy
Light-weight header-only library for parallel function calls and continuations in C++ based on Eric Niebler's talk at CppCon 2019.
-
-
parallel-dfs-dag
A parallel implementation of DFS for Directed Acyclic Graphs (https://research.nvidia.com/publication/parallel-depth-first-search-directed-acyclic-graphs)
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
C++ parallel-computing discussion
C++ parallel-computing related posts
-
Show HN: Coros – A Modern C++ Library for Task Parallelism
-
rodin alternatives - mfem and FreeFem-sources
7 projects | 8 Mar 2023 -
Learn PDE constrained optimization
-
Open source FEA tools instead of ANSYS Workbench and APDL
-
Eighty Years of the Finite Element Method: Birth, Evolution, and Future
-
Fortran on GPU
-
Best Python package(s) to solve PDEs numerically?
-
A note from our sponsor - JetBrains
surveys.jetbrains.com | 2 Sep 2025
Index
What are some of the best open-source parallel-computing projects in C++? This list will help you:
# | Project | Stars |
---|---|---|
1 | Taskflow | 11,183 |
2 | CTranslate2 | 3,992 |
3 | kokkos | 2,299 |
4 | mfem | 1,956 |
5 | cccl | 1,885 |
6 | Vc | 1,491 |
7 | Kratos | 1,162 |
8 | dolfinx | 951 |
9 | libfork | 718 |
10 | oneMath | 705 |
11 | RAJA | 533 |
12 | parlaylib | 371 |
13 | coros | 328 |
14 | feelpp | 324 |
15 | PothosCore | 313 |
16 | areg-sdk | 301 |
17 | CPURasterizer | 192 |
18 | axom | 173 |
19 | ConcurrentDeque | 146 |
20 | cppRouting | 116 |
21 | Lazy | 112 |
22 | Bulk | 94 |
23 | parallel-dfs-dag | 50 |