Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises Learn more →
Top 23 C++ GPU Projects
Productive & portable high-performance programming in Python.Project mention: Taichi v1.5.0 Released! See what's new👇 | reddit.com/r/taichi_lang | 2023-04-17
Check our the realease note (https://github.com/taichi-dev/taichi/releases) for more improvements.
Open3D: A Modern Library for 3D Data ProcessingProject mention: Import many photogrammetry software's scenes into Blender | reddit.com/r/photogrammetry | 2023-03-26
Open3D (JSON, LOG, PLY) 1
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
cuDF - GPU DataFrame LibraryProject mention: A Polars exploration into Kedro | dev.to | 2023-05-17
The interesting thing about Polars is that it does not try to be a drop-in replacement to pandas, like Dask, cuDF, or Modin, and instead has its own expressive API. Despite being a young project, it quickly got popular thanks to its easy installation process and its “lightning fast” performance.
a language for fast, portable data-parallel computationProject mention: Two-tier programming language | reddit.com/r/ProgrammingLanguages | 2023-04-19
The C++ parallel algorithms library.Project mention: Parallel Computations in C++: Where Do I Begin? | reddit.com/r/learnprogramming | 2022-09-23
For a higher level GPU interface, Thrust provides "standard library"-like functions that run in parallel on the GPU (Nvidia only)
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.Project mention: DirectStorage - Loading data to GPU *directly* from the SSD drive, almost without using CPU | reddit.com/r/deeplearning | 2023-05-07
Check out https://github.com/nvidia/DALI
ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises
Mesh optimization library that makes meshes smaller and faster to renderProject mention: Nanite-like LODs experiments | reddit.com/r/GraphicsProgramming | 2023-04-12
1) I used meshoptimizer to simplify meshes and building meshlets. But to combine these meshlets (for more efficient simplifying) I built a graph with weights and partition it. In this weight some metrics are included: the shared border length of a pair of meshlets, their facing direction and other similar metrics. The next step is to simplify these groups of meshlets (fixate borders for each of them and simplify meshlets within the group). So, you can make dependencies of newly generated meshlets on the initial ones.
ArrayFire: a general purpose GPU library.Project mention: Learn WebGPU | news.ycombinator.com | 2023-04-27
Loads of people have stated why easy GPU interfaces are difficult to create, but we solve many difficult things all the time.
Ultimately I think CPUs are just satisfactory for the vast vast majority of workloads. Servers rarely come with any GPUs to speak of. The ecosystem around GPUs is unattractive. CPUs have SIMD instructions that can help. There are so many reasons not to use GPUs. By the time anyone seriously considers using GPUs they're, in my imagination, typically seriously starved for performance, and looking to control as much of the execution details as possible. GPU programmers don't want an automagic solution.
So I think the demand for easy GPU interfaces is just very weak, and therefore no effort has taken off. The amount of work needed to make it as easy to use as CPUs is massive, and the only reason anyone would even attempt to take this on is to lock you in to expensive hardware (see CUDA).
For a practical suggestion, have you taken a look at https://arrayfire.com/ ? It can run on both CUDA and OpenCL, and it has C++, Rust and Python bindings.
cuML - RAPIDS Machine Learning LibraryProject mention: Is it possible to run Sklearn models on a GPU? | reddit.com/r/datascience | 2023-03-05
sklearn can't, bit take a look at cuML (https://github.com/rapidsai/cuml ). It uses the same API as sklearn but executes on GPU.
CUDA Templates for Linear Algebra SubroutinesProject mention: Want to understand INT8 better | reddit.com/r/CUDA | 2023-05-03
The latter (and I guess you were asking about this one) is designed to accelerate NN inference in reduced precision. It is possible to use Tensor Cores for you own purposes, mainly through CUTLASS. But because Tensor Cores are designed to execute matrix multiplications, it can be hard to adapt your problem to them. The performance with them is insane (IIRC 32x the performance of the INT32 pipeline), but only for matrix multiplication…
HeavyDB (formerly OmniSciDB)
Lightning fast C++/CUDA neural network frameworkProject mention: [D] Have their been any attempts to create a programming language specifically for machine learning? | reddit.com/r/MachineLearning | 2023-02-11
In the opposite direction from your question is a very interesting project, TinyNN all implemented as close to the metal as possible and very fast: https://github.com/NVlabs/tiny-cuda-nn
Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNEProject mention: [D] Deep Learning Framework for C++. | reddit.com/r/MachineLearning | 2022-06-12
But you need to have good reasons to do it. Ours is that we have a multi-backend framework, and that we don't want any step in between dev & run. C++ allows for this since the same code can run on training server and edge device as needed. It also allows for building full AI applicatioms with great performances (e g. real time) We dev & use https://github.com/jolibrain/deepdetect for these purposes and it serves us very well, but it's not the faint of heart !
The C++ Standard Library for your entire system.
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.Project mention: Microsoft, Tencent, Baidu Adopting Nvidia CV-CUDA for Computer Vision AI | news.ycombinator.com | 2023-03-21
I'm not familiar with CV-CUDA but it looks interesting.The github may be made useful than the press release: https://github.com/CVCUDA/CV-CUDA
A GLSL Path Tracer
A C++ GPU Computing Library for OpenCL
VK driver for the Raspberry Pi (Broadcom Videocore IV)Project mention: Failed to open Light Display Manager when booting | reddit.com/r/raspberry_pi | 2022-09-04
I was trying to install Vulkan on my Pi3B+ following this link
Fast Neural Machine Translation in C++Project mention: [P] A CLI tool for easy transformer sequence classifier training and inference | reddit.com/r/MachineLearning | 2023-02-01
As a reference, I forked https://github.com/marian-nmt/marian privately to support sequence tagging tasks. With a positional loss mask, It can also support sequence classificaiton.
Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ DriverProject mention: Vladmandic Stable Diffusion added Intel ARC GPU support on Linux | reddit.com/r/IntelArc | 2023-05-01
Update: I was able to fix my issue. I'm using Ubuntu 22.04.2 LTS and have the newest available kernel, 6.3.1. Installing the drivers via apt does not work, instead I needed to use https://github.com/intel/compute-runtime/releases/
stdgpu: Efficient STL-like Data Structures on the GPU
Tuned OpenCL BLASProject mention: OpenCL in Termux | reddit.com/r/termux | 2023-06-02
Install CLBlast: cd git clone https://github.com/CNugteren/CLBlast.git cd CLBlast cmake -B build \ -DBUILD_SHARED_LIBS=OFF \ -DTUNERS=OFF \ -DCMAKE_BUILD_TYPE=Release \ -DCMAKE_INSTALL_PREFIX=/data/data/com.termux/files/usr cd build make -j8 make install
Write Clean C++ Code. Always.. Sonar helps you commit clean C++ code every time. With over 550 unique rules to find C++ bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
C++ GPU related posts
mtensor: a c++/cuda template library for tensor lazy evaluation
1 project | reddit.com/r/programming | 10 May 2023
Vladmandic Stable Diffusion added Intel ARC GPU support on Linux
3 projects | reddit.com/r/IntelArc | 1 May 2023
Pytorch Enable MPS Fallback help?
1 project | reddit.com/r/pytorch | 29 Apr 2023
9 projects | news.ycombinator.com | 27 Apr 2023
Two-tier programming language
6 projects | reddit.com/r/ProgrammingLanguages | 19 Apr 2023
What's your main programming language?
3 projects | reddit.com/r/ScientificComputing | 19 Apr 2023
PyG - PyTorch Geometric for GNNs
1 project | reddit.com/r/engineering_stuff | 19 Apr 2023
A note from our sponsor - ONLYOFFICE
www.onlyoffice.com | 2 Jun 2023
What are some of the best open-source GPU projects in C++? This list will help you: