Top 23 C++ GPU Projects
-
Project mention: [D] PyTorch processes taking up tons of GPU memory - any way to reduce this? | reddit.com/r/MachineLearning | 2022-05-25
Maybe related: https://github.com/pytorch/pytorch/issues/12873
-
Project mention: Programming language for high performance simulations. Is there anything like this already? | reddit.com/r/ProgrammingLanguages | 2022-01-03
What level of simulation are you thinking? If it's low-level (rigid body physics, soft body physics, fluid, etc.) there's (taichi)[https://github.com/taichi-dev/taichi], which turns (a subset of) Python into high perf gpu code.
-
Scout APM
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
-
Project mention: Creating a simple 3d engine to render a car for ML, how hard is it? Sim | reddit.com/r/cpp | 2021-12-16
http://www.open3d.org/ seems like a good choice. Is there anything better I'm missing?
-
Project mention: Currently, what are some of the worst things about C++? | reddit.com/r/cpp | 2022-04-28
I also maintain the build system for Halide, but as it's less greenfield, there are some things I would need team buy-in to change. It's also much more complex (building code generators necessarily is). I'm still pretty happy with it, though. See here: https://github.com/halide/Halide
-
-
Users should work with higher level primitives like tasks, parallel loops, asynchronous functions etc. Think TBB, Thrust, Taskflow, lparallel for CL, etc.
-
DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Project mention: [D] Efficiently loading videos in PyTorch without extracting frames | reddit.com/r/MachineLearning | 2021-10-26 -
SonarQube
Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
-
Project mention: [D] Neural Networks using a generic GPU framework | reddit.com/r/MachineLearning | 2022-01-04
Looking for frameworks with Julia + OpenCL I found array fire. It seems quite good, bonus points for rust bindings. I will keep looking for more, Julia completely fell off my radar.
-
Project mention: Algoritm to simplify geometric data similar to texture bitmapping | reddit.com/r/GraphicsProgramming | 2021-12-14
I've used the "Simplification" operation of meshoptimizer to simplify 3D model geometry: https://github.com/zeux/meshoptimizer
-
Project mention: OmniSciDB – SQL database leveraging GPUs for performance | news.ycombinator.com | 2022-02-01
-
deepdetect
Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
-
If you are thinking of using new features through https://github.com/NVIDIA/libcudacxx , you'll have to learn c++
-
-
Even RPi Zero has (unofficial) Vulkan subset https://github.com/Yours3lf/rpi-vk-driver but unfortunately it lacks compute at all (and SPIR-V, GPU semaphore, 32bit indices for fixed graphics pipeline, ...)
-
-
Project mention: HW transcoding not working (Ubuntu 20 LTS, docker linuxserver) | reddit.com/r/PleX | 2022-05-28
-
-
Project mention: The Next Platform: "Intel Takes The SYCL To Nvidia's CUDA With Migration Tool" | reddit.com/r/hardware | 2022-05-20
Yup. SYCL is the future: https://github.com/illuhad/hipSYCL
-
Project mention: Integrating the CUDA APIs (Driver, Runtime, JIT) in pleasant modern-C++ wrappers | news.ycombinator.com | 2022-03-26
-
-
-
-
mixbench
A GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Project mention: Intel Nukes Alder Lake's AVX-512 Support, Now Fuses It Off in Silicon | reddit.com/r/hardware | 2022-03-02The results I get match the FLOPS figures stated for the respective GPUs, so presumably I can't be memory bound or similar. But if you're still in doubt, I was using this code, comparing the single precision and integer kernels, so let me know any issues you see with the benchmark.
C++ GPU related posts
- HW transcoding not working (Ubuntu 20 LTS, docker linuxserver)
- [D] PyTorch processes taking up tons of GPU memory - any way to reduce this?
- Another (?) Plex & Intel Compute Runtime Thread
- New PUBLIC PMS Version Available - 1.26.1.5798-99a4a6ac9
- Getting amount of free GPU memory on Intel GPUs
- It's time to renew 3DS emulation: We're back!
- I studied electronic engineering for six years and don't know half the languages mentioned in this sub
Index
What are some of the best open-source GPU projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | Pytorch | 56,179 |
2 | taichi | 19,092 |
3 | Open3D | 6,766 |
4 | Halide | 4,938 |
5 | cudf | 4,768 |
6 | Thrust | 3,917 |
7 | DALI | 3,864 |
8 | ArrayFire | 3,826 |
9 | meshoptimizer | 3,256 |
10 | heavydb | 2,668 |
11 | deepdetect | 2,363 |
12 | libcudacxx | 1,974 |
13 | Boost.Compute | 1,290 |
14 | rpi-vk-driver | 1,190 |
15 | GLSL-PathTracer | 1,143 |
16 | compute-runtime | 767 |
17 | stdgpu | 689 |
18 | hipSYCL | 535 |
19 | cuda-api-wrappers | 471 |
20 | neoGFX | 438 |
21 | oneMKL | 306 |
22 | vuh | 301 |
23 | mixbench | 179 |
Are you hiring? Post a new remote job listing for free.