oneMKL
LSQR-CUDA
oneMKL | LSQR-CUDA | |
---|---|---|
2 | 1 | |
567 | 12 | |
1.6% | - | |
8.5 | 1.3 | |
3 days ago | 12 months ago | |
C++ | Cuda | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
oneMKL
-
Stable Diffusion on AMD RDNA™ 3 Architecture
I think there's already been work done to just use intel MKL on any device: https://github.com/oneapi-src/oneMKL
- Developing in heterogeneous environment with the best HPC libraries
LSQR-CUDA
-
CUDA LSQR Solver
Repository here
What are some alternatives?
oneDNN - oneAPI Deep Neural Network Library (oneDNN)
pyopencl - OpenCL integration for Python, plus shiny features
kokkos-kernels - Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
cub - [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
peakperf - Achieve peak performance on x86 CPUs and NVIDIA GPUs
CUDA-Guide - CUDA Guide
nekRS - our next generation fast and scalable CFD code
ArrayFire - ArrayFire: a general purpose GPU library.
monolish - monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Taskflow - A General-purpose Parallel and Heterogeneous Task Programming System
CuTeLib - CUDA Template Library provides simple, typesafe, performant constructs for C++ CUDA projects
RandN - Better random number generation for .NET