SaaSHub helps you find the best software and product alternatives Learn more →
Top 12 C++ Sycl Projects
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
-
mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
-
eaminer
Heterogeneous Ethereum Miner with support for AMD, Intel and Nvidia GPUs using SYCL, OpenCL and CUDA backends
If you are talking about non-small matrix multiplication in MKL, is now in opensource as a part of oneDNN. It literally has exactly the same code, as in MKL (you can see this by inspecting constants or doing high-precision benchmarks).
For small matmul there is libxsmm. It may take tremendous efforts make something faster than oneDNN and libxsmm, as jit-based approach of https://github.com/oneapi-src/oneDNN/blob/main/src/gpu/jit/g... is too flexible: if someone finds a better sequence, oneDNN can reuse it without major change of design.
But MKL is not limited to matmul, I understand it...
Project mention: What Every Developer Should Know About GPU Computing | news.ycombinator.com | 2023-10-21Sapphire Rapids is a CPU.
AMD's primary focus for a GPU software ecosystem these days seems to be implementing CUDA with s/cuda/hip, so AMD directly supports and encourages running GPU software written in CUDA on AMD GPUs.
The only implementation for sycl on AMD GPUs that I can find is a hobby project that apparently is not allowed to use either the 'hip' or 'sycl' names. https://github.com/AdaptiveCpp/AdaptiveCpp
Project mention: For those interested in learning how to build a Language Identification solution using PyTorch, check out my article. | /r/learnmachinelearning | 2023-04-28Link to code sample: https://github.com/oneapi-src/oneAPI-samples/tree/master/AI-and-Analytics/End-to-end-Workloads/LanguageIdentification
Project mention: Portable and vendor neutral parallel programming on heterogeneous platforms | news.ycombinator.com | 2024-04-11
Project mention: Data Parallel Extensions for Python: near-native speed for scientific computing | news.ycombinator.com | 2023-11-24Considering how poorly it seems to support cuda as a backend [0], I wouldn't hold my breath about non intel vendor support (amd cpu or gpu). As for less common gpus, there really is no good support in any library. If you ever want to go down a fun rabbit hole, try to use the gpu in a raspberry pi for something. You'll eventually find one guy who reverse engineered the drivers to make a compiler but that's it.
C++ Sycl related posts
- What Every Developer Should Know About GPU Computing
- Offloading standard C++ PSTL to Intel, NVIDIA and AMD GPUs with AdaptiveCpp
- AMD's HIPRT Working Its Way To Blender With ~25% Faster Rendering
- hipSYCL can now generate a binary that runs on any Intel/NVIDIA/AMD GPU - in a single compiler pass. It is now the first single-pass SYCL compiler, and the first with unified code representation across backends.
- Bringing Nvidia® and AMD support to oneAPI
- Intel oneAPI 2023 Released - AMD & NVIDIA Plugins Available
- The Next Platform: "Intel Takes The SYCL To Nvidia's CUDA With Migration Tool"
-
A note from our sponsor - SaaSHub
www.saashub.com | 19 Apr 2024
Index
What are some of the best open-source Sycl projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | GLM | 8,653 |
2 | oneDNN | 3,446 |
3 | AdaptiveCpp | 1,037 |
4 | oneAPI-samples | 830 |
5 | triSYCL | 436 |
6 | occa | 379 |
7 | mixbench | 330 |
8 | BabelStream | 307 |
9 | dpctl | 90 |
10 | ParallelReductionsBenchmark | 59 |
11 | gtensor | 33 |
12 | eaminer | 4 |