hipBLAS | rocPRIM | |
---|---|---|
3 | 1 | |
100 | 145 | |
- | 4.8% | |
9.3 | 8.5 | |
6 days ago | 10 days ago | |
C++ | C++ | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hipBLAS
- A 100x speedup with unsafe Python
-
How do you build Apps with hipblas using CMake?
For example: the simple SGEMM example can be manually compiled in either platform simply as:
-
How do you build hipblas using CMake?
'm trying to build the SGEMM example in the library using CMake v3.16, but I'm just stuck ...
rocPRIM
-
Nvidia in the Valley
> AMD doesn't have a library of warp-level/kernel-level/global "software primitives" like Cuda Unbound or Thrust either.
The ROCm software primatives library is rocPRIM and the ROCm equivalent to Thrust is rocThrust.
https://github.com/ROCmSoftwarePlatform/rocPRIM
What are some alternatives?
hipCUB - Reusable software components for ROCm developers
core - Decentralized Fog Computing Platform
AdaptiveCpp - Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
Bolt - Bolt is a C++ template library optimized for GPUs. Bolt provides high-performance library implementations for common algorithms such as scan, reduce, transform, and sort.
alpaka - Abstraction Library for Parallel Kernel Acceleration :llama:
rocThrust - ROCm Thrust - run Thrust dependent software on AMD GPUs
stdgpu - stdgpu: Efficient STL-like Data Structures on the GPU