libGPGPU
Boost.Compute
libGPGPU | Boost.Compute | |
---|---|---|
4 | - | |
8 | 1,506 | |
- | 1.1% | |
8.0 | 0.0 | |
5 months ago | 18 days ago | |
C++ | C++ | |
MIT License | Boost Software License 1.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
libGPGPU
- Can anyone with Arc750 or 770 try this FMA benchmark written with OpenCL? I want to compare gigaflops.
-
I made a fused-multiply-add test program that distributes work onto all GPUs, CPUs, iGPU of computer. As a result, it outputs the total combined GFLOPS value of the computer.
libGPGPU/fma_benchmark.rar at master · tugrul512bit/libGPGPU · GitHub
-
AMD and Intel Battle for Windows 11 AI Acceleration Lead
16000x16000 mandelbrot generator with 50 iterations per pixel takes only 50 milliseconds using gt1030+iGpu+ryzen 7900 https://github.com/tugrul512bit/libGPGPU
Boost.Compute
We haven't tracked posts mentioning Boost.Compute yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
qlora - QLoRA: Efficient Finetuning of Quantized LLMs
Thrust - [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
compute-runtime - Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
moodycamel - A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
Cekirdekler - Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
ArrayFire - ArrayFire: a general purpose GPU library.
EpicWarCL - C# fully OpenCL(C99)-accelerated game demo and benchmark, prealpha- stage abondonware.
C++React - C++React: A reactive programming library for C++11.
HPX - The C++ Standard Library for Parallelism and Concurrency
Bolt - Bolt is a C++ template library optimized for GPUs. Bolt provides high-performance library implementations for common algorithms such as scan, reduce, transform, and sort.
VexCL - VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP