vuh
ParallelReductionsBenchmark
Our great sponsors
vuh | ParallelReductionsBenchmark | |
---|---|---|
3 | 2 | |
340 | 59 | |
- | - | |
2.8 | 4.6 | |
6 months ago | 5 months ago | |
C++ | C++ | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vuh
-
GLSL shaders for OpenCL
The only one that is relatively lightweight (and doesn't have dependencies) is vuh (https://github.com/Glavnokoman/vuh) that looks unsupported.
- Resources for Vulkan GPGPU searched
-
Blender 3.0 takes support for AMD GPUs to the next level
Don't try to use it directly, you need a higher level library.
https://github.com/KomputeProject/kompute seems like what's taking off. There are also smaller ones like https://github.com/Glavnokoman/vuh etc.
ParallelReductionsBenchmark
-
Failing to Reach 204 GB/S DDR4 Bandwidth
For the single threaded version, they have a data hazard on the sums that could be smoothed out with a little loop unrolling and separate variables.
But in the [threaded version](https://github.com/unum-cloud/ParallelReductions/blob/fd16d9...) they have separate slots for an accumulator but it's still in a shared vector, which most likely has the issue I described.
What are some alternatives?
ocl - OpenCL for Rust
MatX - An efficient C++17 GPU numerical computing library with Python-like syntax
GLSL - GLSL Shading Language Issue Tracker
ispc - Intel® Implicit SPMD Program Compiler
rust-gpu - 🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧
gpuowl - GPU Mersenne primality test.
cuda-api-wrappers - Thin C++-flavored header-only wrappers for core CUDA APIs: Runtime, Driver, NVRTC, NVTX.
alpaka - Abstraction Library for Parallel Kernel Acceleration :llama:
kompute - General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
cuda_memtest - Fork of CUDA GPU memtest :eyeglasses:
uVkCompute - A micro Vulkan compute pipeline and a collection of benchmarking compute shaders
eaminer - Heterogeneous Ethereum Miner with support for AMD, Intel and Nvidia GPUs using SYCL, OpenCL and CUDA backends