MatX
ginkgo
MatX | ginkgo | |
---|---|---|
7 | 2 | |
1,117 | 373 | |
1.2% | 1.3% | |
9.1 | 9.8 | |
3 days ago | 6 days ago | |
C++ | C++ | |
BSD 3-clause "New" or "Revised" License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
MatX
- An efficient C++17 GPU numerical computing library with Python-like syntax
-
MatX: Efficient C++17 GPU numerical computing library with Python-like syntax
Hi, what specifically are you looking to benchmark on the K80? Users are free to contribute and we've had many external PRs.
Contribution guide is here: https://github.com/NVIDIA/MatX/blob/main/CONTRIBUTING.md
-
Blaze: High Performance Mathematics In C++
For GPU support take a look at our library:
https://github.com/NVIDIA/MatX
If anything is missing we're happy to take feature requests.
- C++ for scientific programming?
ginkgo
- AMD HIP + Cuda in same program
-
Incorporating abidiff into CI?
I had exactly the same thought after watching the video (though I wanted to do this for a while anyways) and did exactly that :) I found it interesting to see how different changes impact the ABI of our library, even though we don't promise ABI compatibility or anything. We add the head of the diff part to a PR comment, and store everything else as a build artifact. See the corresponding PR here: https://github.com/ginkgo-project/ginkgo/pull/922
What are some alternatives?
ParallelReductionsBenchmark - Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
HIP - HIP: C++ Heterogeneous-Compute Interface for Portability
GPU-accelerated-guppy-basecalling - GPU-accelerated guppy basecalling and demultiplexing on Linux
alpaka - Abstraction Library for Parallel Kernel Acceleration :llama:
cuda_memtest - Fork of CUDA GPU memtest :eyeglasses:
arbor - The Arbor multi-compartment neural network simulation library.
Trilinos - Primary repository for the Trilinos Project
stdgpu - stdgpu: Efficient STL-like Data Structures on the GPU
conan - Conan - The open-source C and C++ package manager
AdaptiveCpp - Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
conan-center-index - Recipes for the ConanCenter repository
Halide - a language for fast, portable data-parallel computation