ginkgo
MatX
ginkgo | MatX | |
---|---|---|
2 | 7 | |
375 | 1,117 | |
1.9% | 1.2% | |
9.8 | 9.1 | |
2 days ago | 6 days ago | |
C++ | C++ | |
BSD 3-clause "New" or "Revised" License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ginkgo
- AMD HIP + Cuda in same program
-
Incorporating abidiff into CI?
I had exactly the same thought after watching the video (though I wanted to do this for a while anyways) and did exactly that :) I found it interesting to see how different changes impact the ABI of our library, even though we don't promise ABI compatibility or anything. We add the head of the diff part to a PR comment, and store everything else as a build artifact. See the corresponding PR here: https://github.com/ginkgo-project/ginkgo/pull/922
MatX
- An efficient C++17 GPU numerical computing library with Python-like syntax
-
MatX: Efficient C++17 GPU numerical computing library with Python-like syntax
Hi, what specifically are you looking to benchmark on the K80? Users are free to contribute and we've had many external PRs.
Contribution guide is here: https://github.com/NVIDIA/MatX/blob/main/CONTRIBUTING.md
-
Blaze: High Performance Mathematics In C++
For GPU support take a look at our library:
https://github.com/NVIDIA/MatX
If anything is missing we're happy to take feature requests.
- C++ for scientific programming?
What are some alternatives?
HIP - HIP: C++ Heterogeneous-Compute Interface for Portability
ParallelReductionsBenchmark - Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
alpaka - Abstraction Library for Parallel Kernel Acceleration :llama:
GPU-accelerated-guppy-basecalling - GPU-accelerated guppy basecalling and demultiplexing on Linux
arbor - The Arbor multi-compartment neural network simulation library.
cuda_memtest - Fork of CUDA GPU memtest :eyeglasses:
stdgpu - stdgpu: Efficient STL-like Data Structures on the GPU
Trilinos - Primary repository for the Trilinos Project
AdaptiveCpp - Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
conan - Conan - The open-source C and C++ package manager
Halide - a language for fast, portable data-parallel computation
conan-center-index - Recipes for the ConanCenter repository