oneMKL
CuTeLib
Our great sponsors
oneMKL | CuTeLib | |
---|---|---|
2 | 1 | |
565 | 0 | |
3.7% | - | |
8.5 | 7.8 | |
10 days ago | over 2 years ago | |
C++ | C++ | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
oneMKL
-
Stable Diffusion on AMD RDNA™ 3 Architecture
I think there's already been work done to just use intel MKL on any device: https://github.com/oneapi-src/oneMKL
- Developing in heterogeneous environment with the best HPC libraries
CuTeLib
-
Guidelines for using raw pointers in modern C++ and GPUs
I am building a similarly library, including copy and streams and so on. Check it out https://github.com/anders-wind/CuTeLib
What are some alternatives?
oneDNN - oneAPI Deep Neural Network Library (oneDNN)
taco - The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
kokkos-kernels - Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
stlbm
peakperf - Achieve peak performance on x86 CPUs and NVIDIA GPUs
mtensor - a c++/cuda template library for tensor lazy evaluation
nekRS - our next generation fast and scalable CFD code
blitz - Blitz++ Multi-Dimensional Array Library for C++
ArrayFire - ArrayFire: a general purpose GPU library.
CHAI - Copy-hiding array abstraction to automatically migrate data between memory spaces
monolish - monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
CppCoreGuidelines - The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++