C++ GPU

Open-source C++ projects categorized as GPU
Cuda CPP Opencl HPC Gpgpu

Top 23 C++ GPU Projects

  1. taichi

    Productive, portable, and performant GPU programming in Python.

    Project mention: Taichi: Productive, portable, and performant GPU programming in Python | news.ycombinator.com | 2024-08-20
  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. Open3D

    Open3D: A Modern Library for 3D Data Processing

  4. cudf

    cuDF - GPU DataFrame Library

    Project mention: Unleashing GPU Power: Supercharge Your Data Processing with cuDF | dev.to | 2024-06-21

    cuDF Documentation

  5. catboost

    A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

    Project mention: 🚀 Why Your ML Service Needs Rust + CatBoost: A Setup Guide That Actually Works | dev.to | 2025-01-19

    [package] name = "MLApp" version = "0.1.0" edition = "2021" [dependencies] catboost = { git = "https://github.com/catboost/catboost", rev = "0bfdc35"}

  6. cutlass

    CUDA Templates for Linear Algebra Subroutines

    Project mention: Memory and ILP handling in 2D convolutions | news.ycombinator.com | 2024-07-20

    Years ago I started a collection of convolution optimization resources: https://github.com/mratsim/laser/wiki/Convolution-optimisati...

    Also checked and apparently Nvidia Cutlass now supports generic convolutions: https://github.com/NVIDIA/cutlass

  7. meshoptimizer

    Mesh optimization library that makes meshes smaller and faster to render

    Project mention: UE5 Nanite in WebGPU | news.ycombinator.com | 2024-09-05

    > Making a nanite mesh is complicated, with a lot of internal offsets for linking, and so far only Unreal Engine's editor does it.

    meshoptimizer [1] is an OSS implementation of meshlet generation, which is what most people think of when they think of "Nanite's algorithm". Bevy, mentioned in a sibling reply, uses meshoptimizer as the generation tool.

    (Strictly speaking, "Nanite" is a brand name that encompasses a large collection of techniques, including meshlets, software rasterization, streaming, etc. For clarity during technical discussions, I prefer to talk about individual techniques, since they're really separate, even though they complement one another. For example, software rasterization can be used without meshlets if your triangles are really small. Streaming can be useful even if you aren't using meshlets. And so on.)

    [1]: https://github.com/zeux/meshoptimizer

  8. Halide

    a language for fast, portable data-parallel computation

    Project mention: Compiling Array Languages for SIMD [pdf] | news.ycombinator.com | 2025-02-12

    > Hence it becomes a game of scheduling. You already know what you need to optimise but actually doing so gets really hard really fast.

    This immediately makes me think of Halide, which was specifically invented to make this easier to do by decoupling the algorithm from the scheduler.

    Kind of sad that it doesn't see to have caught on much.

    [0] https://halide-lang.org/

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. DALI

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

    Project mention: Essential Deep Learning Checklist: Best Practices Unveiled | dev.to | 2024-06-17

    How to Accomplish: Use a combination of geometric transformations (e.g., rotation, scaling, cropping, flipping), color space adjustments (e.g., brightness, contrast, saturation), and other techniques (e.g., noise injection, blurring, cutout). Libraries such as ImgAug, DeepMind Augmentation, Albumentations, and NVIDIA DALI offer a wide range of ready-to-use augmentation techniques that can introduce the necessary diversity into your dataset.

  11. MegEngine

    MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

  12. ArrayFire

    ArrayFire: a general purpose GPU library.

  13. cuml

    cuML - RAPIDS Machine Learning Library

  14. FluidX3D

    The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

    Project mention: FluidX3D | news.ycombinator.com | 2024-12-07
  15. tiny-cuda-nn

    Lightning fast C++/CUDA neural network framework

  16. heavydb

    HeavyDB (formerly OmniSciDB)

  17. executorch

    On-device AI across mobile, embedded and edge for PyTorch

    Project mention: PyTorch – Torchchat: Chat with LLMs Everywhere | news.ycombinator.com | 2024-07-31

    Did not know executorch existed! That's so cool! I have it on my bucket list to tinker with running LLMs on wearables after I'm a little further along in learning, great to see official tooling for that!

    https://github.com/pytorch/executorch

  18. deepdetect

    Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

  19. CV-CUDA

    CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

  20. GLSL-PathTracer

    A toy physically based GPU path tracer (C++/OpenGL/GLSL)

  21. Boost.Compute

    A C++ GPU Computing Library for OpenCL

  22. cccl

    CUDA Core Compute Libraries

    Project mention: Sorting Algorithm with CUDA | news.ycombinator.com | 2025-03-11
  23. MatX

    An efficient C++17 GPU numerical computing library with Python-like syntax

  24. marian

    Fast Neural Machine Translation in C++

  25. rpi-vk-driver

    VK driver for the Raspberry Pi (Broadcom Videocore IV)

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ GPU discussion

Log in or Post with

C++ GPU related posts

Index

What are some of the best open-source GPU projects in C++? This list will help you:

# Project Stars
1 taichi 26,902
2 Open3D 12,105
3 cudf 8,809
4 catboost 8,300
5 cutlass 7,168
6 meshoptimizer 6,096
7 Halide 6,007
8 DALI 5,331
9 MegEngine 4,786
10 ArrayFire 4,654
11 cuml 4,547
12 FluidX3D 4,306
13 tiny-cuda-nn 3,933
14 heavydb 2,975
15 executorch 2,637
16 deepdetect 2,528
17 CV-CUDA 2,471
18 GLSL-PathTracer 1,904
19 Boost.Compute 1,591
20 cccl 1,553
21 MatX 1,298
22 marian 1,286
23 rpi-vk-driver 1,232

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that C++ is
the 7th most popular programming language
based on number of references?