LSQR-CUDA
oneMKL
LSQR-CUDA | oneMKL | |
---|---|---|
1 | 2 | |
12 | 567 | |
- | 1.6% | |
1.3 | 8.5 | |
12 months ago | 4 days ago | |
Cuda | C++ | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
LSQR-CUDA
-
CUDA LSQR Solver
Repository here
oneMKL
-
Stable Diffusion on AMD RDNA™ 3 Architecture
I think there's already been work done to just use intel MKL on any device: https://github.com/oneapi-src/oneMKL
- Developing in heterogeneous environment with the best HPC libraries
What are some alternatives?
pyopencl - OpenCL integration for Python, plus shiny features
oneDNN - oneAPI Deep Neural Network Library (oneDNN)
cub - [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
kokkos-kernels - Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
CUDA-Guide - CUDA Guide
peakperf - Achieve peak performance on x86 CPUs and NVIDIA GPUs
nekRS - our next generation fast and scalable CFD code
ArrayFire - ArrayFire: a general purpose GPU library.
monolish - monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Taskflow - A General-purpose Parallel and Heterogeneous Task Programming System
CuTeLib - CUDA Template Library provides simple, typesafe, performant constructs for C++ CUDA projects
RandN - Better random number generation for .NET