C++ HPC

Open-source C++ projects categorized as HPC

High Performance Computing. Scientists, engineers, and researchers use high performance computing (HPC) for incredibly computationally intensive tasks, like those used in weather forecasting, oil and gas exploration, physics, quantum mechanics, and other areas.

While HPC can be run on a single system, its real power comes from connecting multiple HPC nodes into a cluster, which may also be called a supercomputer. This provides the capabilities to compute extreme-scale simulations that are not feasible or even possible on a single system.

Top 23 C++ HPC Projects

  1. ArrayFire

    ArrayFire: a general purpose GPU library.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. FluidX3D

    The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

    Project mention: FluidX3D | news.ycombinator.com | 2024-12-07
  4. mfem

    Lightweight, general, scalable C++ library for finite element methods

  5. less_slow.cpp

    Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

    Project mention: Faster sorting with SIMD CUDA intrinsics | news.ycombinator.com | 2025-05-06

    Thanks, appreciate the gesture :)

    Traditional SWAR on GPUs is a fascinating topic. I've begun assembling a set of synthetic benchmarks to compare DP4A vs. DPX (<https://github.com/ashvardanian/less_slow.cpp/pull/35>), but it feels incomplete without SWAR. My working hypothesis is that 64-bit SWAR on properly aligned data could be very useful in GPGPU, though FMA/MIN/MAX operations in that PR might not be the clearest showcase of its strengths. Do you have a better example or use case in mind?

  6. VkFFT

    Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library

  7. cccl

    CUDA Core Compute Libraries

    Project mention: Learning Assembly for Fun, Performance and Profit | news.ycombinator.com | 2025-04-12

    So I would say skill at GPU assembly is in-demand for the elite tier of GPU performance work. Not necessarily writing much of it (though see [1] for an example, this is the kernel of multisplit as used in Nvidia's Onesweep implementation), but definitely in being able to read it so you can understand what the compiled code is actually doing. I'll also cite as evidence of that the incredible work of the engineers on Nanite. They describe writing the core of the microtriangle software renderer in HLSL but analyzing the assembler output to optimize down to the cycle level, as described in their "deep dive into Nanite virtualized geometry" talk (timestamp points to the reference to instruction-level micro-optimization).

    [1]: https://github.com/NVIDIA/cccl/blob/2d1fa6bc9235106740d9373c...

    [2]: https://www.youtube.com/watch?v=eviSykqSUUw&t=2073s

  8. AdaptiveCpp

    Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

    Project mention: AdaptiveCpp – Implementation of SYCL and C++ Parallelism for CPUs and GPUs | news.ycombinator.com | 2025-01-02
  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. Boost.Compute

    A C++ GPU Computing Library for OpenCL

  11. MatX

    An efficient C++17 GPU numerical computing library with Python-like syntax

  12. Trilinos

    Primary repository for the Trilinos Project

  13. eve

    Expressive Vector Engine - SIMD in C++ Goes Brrrr (by jfalcou)

    Project mention: Expressive Vector Engine – SIMD in C++ | news.ycombinator.com | 2025-01-05

    Here is a bunch of simple examples: https://github.com/jfalcou/eve/blob/fb093a0553d25bb8114f1396...

    I personally think we have the following strenghs:

    * Algorithms. Writing SIMD loops is very hard. We give you a lot of ready to go loops. (find, search, remove, set_intersection to name a few).

  14. RaftLib

    The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators

  15. Fastor

    A lightweight high performance tensor algebra framework for modern C++

  16. oneMath

    oneAPI Math Library (oneMath)

  17. relion

    Image-processing software for cryo-electron microscopy

  18. ginkgo

    Numerical linear algebra software package (by ginkgo-project)

  19. occa

    Portable and vendor neutral framework for parallel programming on heterogeneous platforms.

  20. blitz

    Blitz++ Multi-Dimensional Array Library for C++ (by blitzpp)

  21. alpaka

    Abstraction Library for Parallel Kernel Acceleration :llama: (by alpaka-group)

  22. Umpire

    An application-focused API for memory management on NUMA & GPU architectures (by LLNL)

  23. BabelStream

    STREAM, for lots of devices written in many programming models

  24. qmcpack

    Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support

  25. nekRS

    our next generation fast and scalable CFD code

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ HPC discussion

Log in or Post with

C++ HPC related posts

  • Show HN: Less Slow C++

    20 projects | news.ycombinator.com | 18 Apr 2025
  • ChipStar: Run CUDA/Hip on SPIR-V via OpenCL/Level Zero

    1 project | news.ycombinator.com | 15 Dec 2023
  • An efficient C++17 GPU numerical computing library with Python-like syntax

    1 project | /r/programming | 5 Oct 2023
  • MatX: Efficient C++17 GPU numerical computing library with Python-like syntax

    1 project | /r/patient_hackernews | 5 Oct 2023
  • Learn WebGPU

    9 projects | news.ycombinator.com | 27 Apr 2023
  • Standard way of doing maths with arrays?

    2 projects | /r/cpp_questions | 9 Mar 2023
  • Blaze: High Performance Mathematics In C++

    2 projects | news.ycombinator.com | 16 Jan 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 14 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source HPC projects in C++? This list will help you:

# Project Stars
1 ArrayFire 4,692
2 FluidX3D 4,414
3 mfem 1,885
4 less_slow.cpp 1,747
5 VkFFT 1,631
6 cccl 1,629
7 AdaptiveCpp 1,618
8 Boost.Compute 1,606
9 MatX 1,321
10 Trilinos 1,282
11 eve 1,188
12 RaftLib 965
13 Fastor 779
14 oneMath 672
15 relion 476
16 ginkgo 464
17 occa 418
18 blitz 413
19 alpaka 379
20 Umpire 357
21 BabelStream 335
22 qmcpack 334
23 nekRS 327

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?