oneMKL
peakperf
Our great sponsors
oneMKL | peakperf | |
---|---|---|
2 | 2 | |
565 | 56 | |
3.7% | - | |
8.5 | 4.4 | |
10 days ago | 2 months ago | |
C++ | C++ | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
oneMKL
-
Stable Diffusion on AMD RDNA™ 3 Architecture
I think there's already been work done to just use intel MKL on any device: https://github.com/oneapi-src/oneMKL
- Developing in heterogeneous environment with the best HPC libraries
peakperf
-
lscpu + neofetch = cpufetch
Dr-Noob here. Created an account just to comment on this post. I appreciate all of your comments.
For the ones who think that cpufetch uses lscpu (especially the one who wrote the title of this post), please see https://www.reddit.com/r/linux/comments/milnza/cpufetch_simp...
About the peak performance, nezirus, the purpose is to have a quick look of how powerful a CPU is supposed to be. Peak performance does not measure the real performance of a CPU but it is a rough estimate of it. The peak performance is one of the distinguishing marks of cpufetch and is one of my favorite fields of cpufetch. Concerning the fight between Gold 6238 and EPYC 7702P, is not the other way around. If you are able to use the full power of the CPU, Gold is much more powerful. However, in a real program, this is not always true. For more information about the peak performance, see https://github.com/Dr-Noob/peakperf. There you will understand how peak performance is calculated and how it works.
Thank you very much for your "text screenshots", I really like to see my program on all this variety of hardware!
-
cpufetch - Simplistic yet fancy CPU architecture fetching tool (supports x86_64 and ARM)
If you are interested, you can find more information in another project of mine, peakperf (https://github.com/Dr-Noob/peakperf).
What are some alternatives?
oneDNN - oneAPI Deep Neural Network Library (oneDNN)
XiangShan - Open-source high-performance RISC-V processor
kokkos-kernels - Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
cpufetch - Simple yet fancy CPU architecture fetching tool
nekRS - our next generation fast and scalable CFD code
png2ascii - Lightning fast ASCII image generator
ArrayFire - ArrayFire: a general purpose GPU library.
monolish - monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
x86info - x86info : x86 processor register decoder.
LSQR-CUDA - This is a LSQR-CUDA implementation written by Lawrence Ayers under the supervision of Stefan Guthe of the GRIS institute at the Technische Universität Darmstadt. The LSQR library was authored Chris Paige and Michael Saunders.
duf - Disk Usage/Free Utility - a better 'df' alternative