Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push. Learn more →
Top 23 C++ Opencl Projects
-
mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
-
Loads of people have stated why easy GPU interfaces are difficult to create, but we solve many difficult things all the time.
Ultimately I think CPUs are just satisfactory for the vast vast majority of workloads. Servers rarely come with any GPUs to speak of. The ecosystem around GPUs is unattractive. CPUs have SIMD instructions that can help. There are so many reasons not to use GPUs. By the time anyone seriously considers using GPUs they're, in my imagination, typically seriously starved for performance, and looking to control as much of the execution details as possible. GPU programmers don't want an automagic solution.
So I think the demand for easy GPU interfaces is just very weak, and therefore no effort has taken off. The amount of work needed to make it as easy to use as CPUs is massive, and the only reason anyone would even attempt to take this on is to lock you in to expensive hardware (see CUDA).
For a practical suggestion, have you taken a look at https://arrayfire.com/ ? It can run on both CUDA and OpenCL, and it has C++, Rust and Python bindings.
-
CodiumAI
TestGPT | Generating meaningful tests for busy devs. Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.
-
For completeness, it looks like this question was posted to the oneDNN GitHub repo and the response was to stay tune for updates.
-
-
Project mention: Vladmandic Stable Diffusion added Intel ARC GPU support on Linux | reddit.com/r/IntelArc | 2023-05-01
Update: I was able to fix my issue. I'm using Ubuntu 22.04.2 LTS and have the newest available kernel, 6.3.1. Installing the drivers via apt does not work, instead I needed to use https://github.com/intel/compute-runtime/releases/
-
The Zero 2 still uses VideoCore IV for which we have the documentation and various projects making use of it. There is a open source Mesa driver and also OpenCL support: https://github.com/doe300/VC4CL
-
Project mention: AMD's HIPRT Working Its Way To Blender With ~25% Faster Rendering | reddit.com/r/Amd | 2023-03-24
In fact SYCL was initially called hipSYCL because it is based on AMD's ROCm/HIP. AMD had hipSYCL code running on the Frontier supercomputer four years ago at least and continues to support it.
-
Sonar
Write Clean C++ Code. Always.. Sonar helps you commit clean C++ code every time. With over 550 unique rules to find C++ bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
-
-
-
-
I tried very similar thing. My purpose was to run llama-cpp-python with CLBlast GPU acceleration via clvk on VulkanSDK on my M1 Max computer. I downloaded VulkanSDK for macOS, cloned clvk(https://github.com/kpet/clvk) and CLBlast. Build was successful but there is a problem; when clCreateCommandQueue function was called with CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE option(in ggml-opencl.c of llama.cpp) , an error was happened and I do not know how to handle it.
-
Project mention: OpenCL 3.0.12 Released With Command Buffers Mutable Dispatch Extension and Enhanced Layers Support | reddit.com/r/OpenCL | 2022-09-22
-
-
I wrote the Fortran version of BabelStream (https://github.com/UoB-HPC/BabelStream/pull/135) to compare all of these, which might be useful as well. There is also MiniWeather, which supports all OpenMP, OpenACC and StdPar for GPU: https://github.com/mrnorman/miniWeather/tree/main/fortran.
-
mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
-
-
Project mention: Linus Tech Tips: "China doesn't want me to have this GPU [Moore Threads MTT S80]" (Linus Tech Tips Reviews the Moore Threads MTT S80 GPU) | reddit.com/r/hardware | 2023-03-12
I know PyTorch supports OpenCL nows and you can do training with it as well. See here. Never try it myself.
-
Project mention: [P] OpenCL backend for PyTorch - progress works with mainstream pytorch | reddit.com/r/MachineLearning | 2022-10-31
I'm working on PyTorch OpenCL backend based on dlprimitives core library. It exists for a while but until now it required building custom pytorch version.
-
-
xreg
Library and executables for modeling and registration applications in medical image analysis. Particular emphasis on intraoperative fluoroscopic (X-ray) navigation via 2D/3D registration.
-
chip-spv
CHIP-SPV is a backend infrastructure for HIP/CUDA running on SPIR-V via OpenCL or Level Zero APIs.
Project mention: In the next 5 years, what do you think can push OpenCL adoption? | reddit.com/r/OpenCL | 2023-04-27Regarding the second item (CUDA to OpenCL), have a look at hipstar: https://github.com/CHIP-SPV/hipstar
-
ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
C++ Opencl related posts
- LangChain / LlamaCpp on M1 GPU (MPS)?
- Vladmandic Stable Diffusion added Intel ARC GPU support on Linux
- Learn WebGPU
- In the next 5 years, what do you think can push OpenCL adoption?
- Intel Arc Driver Overhead - Just a Myth?
- How do you allocate more than 4GB of memory for OpenCL in A770 16GB?
- Proxmox iGPU passthrough to LXC not working
-
A note from our sponsor - CodiumAI
codium.ai | 31 May 2023
Index
What are some of the best open-source Opencl projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | mace | 4,775 |
2 | ArrayFire | 4,145 |
3 | oneDNN | 3,164 |
4 | Boost.Compute | 1,411 |
5 | compute-runtime | 923 |
6 | VC4CL | 710 |
7 | OpenSYCL | 707 |
8 | VexCL | 688 |
9 | amgcl | 618 |
10 | triSYCL | 415 |
11 | xmrig-amd | 388 |
12 | clvk | 266 |
13 | opencl-intercept-layer | 257 |
14 | AutoDock-GPU | 251 |
15 | BabelStream | 234 |
16 | mixbench | 233 |
17 | RayTracing | 218 |
18 | pytorch_dlprim | 126 |
19 | dlprimitives | 106 |
20 | gpuowl | 83 |
21 | xreg | 65 |
22 | chip-spv | 61 |
23 | ParallelReductionsBenchmark | 55 |