C++ GPU

Open-source C++ projects categorized as GPU
Topics: Cuda CPP Opencl HPC Gpgpu

Top 23 C++ GPU Projects

  • taichi

    Productive, portable, and performant GPU programming in Python.

  • Project mention: CERN Root | news.ycombinator.com | 2024-06-01

    The haughtiness is not for nothing. Since Dec 2023, they made a lame excuse that Pytorch didn't support 3.12: https://github.com/taichi-dev/taichi/issues/8365#issuecommen...

    Later, even when Pytorch added support for 3.12, nothing changed (so far) in Taichi.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Open3D

    Open3D: A Modern Library for 3D Data Processing

  • Project mention: Does anyone else agree that the links to the latest development version of Open3D don't work? | /r/cscareerquestions | 2023-07-10

    I was going to file a bug about another issue, but I have to download the development version. This is why I want this solved quickly. None of the links seem to work: https://github.com/isl-org/Open3D/issues/6259

  • cudf

    cuDF - GPU DataFrame Library

  • Project mention: cuDF – GPU DataFrame Library | news.ycombinator.com | 2024-06-02
  • Halide

    a language for fast, portable data-parallel computation

  • Project mention: Show HN: Flash Attention in ~100 lines of CUDA | news.ycombinator.com | 2024-03-16

    If CPU/GPU execution speed is the goal while simultaneously code golfing the source size, https://halide-lang.org/ might have come in handy.

  • meshoptimizer

    Mesh optimization library that makes meshes smaller and faster to render

  • DALI

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

  • Project mention: [D] Will data augmentations work faster on TPUs? | /r/MachineLearning | 2023-12-07

    Another option is DALI https://github.com/NVIDIA/DALI For my project while training EfficientNet2, it was a game changer. But it a way harder to implement in code than TorchVision or Kornia.

  • cutlass

    CUDA Templates for Linear Algebra Subroutines

  • Project mention: Optimization Techniques for GPU Programming [pdf] | news.ycombinator.com | 2023-08-09

    I would recommend the course from Oxford (https://people.maths.ox.ac.uk/gilesm/cuda/). Also explore the tutorial section of cutlass (https://github.com/NVIDIA/cutlass/blob/main/media/docs/cute/...) if you want to learn more about high performance gemm.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • MegEngine

    MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

  • ArrayFire

    ArrayFire: a general purpose GPU library.

  • cuml

    cuML - RAPIDS Machine Learning Library

  • Project mention: FLaNK Stack Weekly for 13 November 2023 | dev.to | 2023-11-13
  • tiny-cuda-nn

    Lightning fast C++/CUDA neural network framework

  • FluidX3D

    The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.

  • Project mention: FluidX3D | news.ycombinator.com | 2024-03-24
  • heavydb

    HeavyDB (formerly OmniSciDB)

  • deepdetect

    Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

  • Project mention: Exploring Open-Source Alternatives to Landing AI for Robust MLOps | dev.to | 2023-12-13

    For those seeking a lightweight solution for setting up deep learning REST APIs across platforms without the complexity of Kubernetes, Deepdetect is worth considering.

  • CV-CUDA

    CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

  • GLSL-PathTracer

    A toy physically based GPU path tracer (C++/OpenGL/GLSL)

  • Boost.Compute

    A C++ GPU Computing Library for OpenCL

  • executorch

    On-device AI across mobile, embedded and edge for PyTorch

  • Project mention: ExecuTorch: Enabling On-Device interference for embedded devices | news.ycombinator.com | 2023-10-17

    Yes ExecuTorch is currently targeted at Edge devices. The runtime is written in C++ with 50KB binary size (without kernels) and should run in most of platforms. You are right that we have not integrated to Nvidia backend yet. Have you tried torch.compile() in PyTorch 2.0? It would do the Nvidia optimization for you without Torchscript. If you have specific binary size or edge specific request, feel free to file issues in https://github.com/pytorch/executorch/issues

  • rpi-vk-driver

    VK driver for the Raspberry Pi (Broadcom Videocore IV)

  • marian

    Fast Neural Machine Translation in C++

  • MatX

    An efficient C++17 GPU numerical computing library with Python-like syntax

  • Project mention: An efficient C++17 GPU numerical computing library with Python-like syntax | /r/programming | 2023-10-05
  • stdgpu

    stdgpu: Efficient STL-like Data Structures on the GPU

  • compute-runtime

    Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver

  • Project mention: Intel Graphics Compute Runtime for OneAPI Level Zero and OpenCL | news.ycombinator.com | 2023-08-02
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ GPU discussion

Log in or Post with

C++ GPU related posts

Index

What are some of the best open-source GPU projects in C++? This list will help you:

Project Stars
1 taichi 24,930
2 Open3D 10,682
3 cudf 7,891
4 Halide 5,747
5 meshoptimizer 5,239
6 DALI 4,956
7 cutlass 4,739
8 MegEngine 4,734
9 ArrayFire 4,444
10 cuml 4,001
11 tiny-cuda-nn 3,508
12 FluidX3D 3,365
13 heavydb 2,914
14 deepdetect 2,500
15 CV-CUDA 2,234
16 GLSL-PathTracer 1,760
17 Boost.Compute 1,511
18 executorch 1,367
19 rpi-vk-driver 1,219
20 marian 1,186
21 MatX 1,129
22 stdgpu 1,109
23 compute-runtime 1,092

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com