oneDNN
peakperf
Our great sponsors
oneDNN | peakperf | |
---|---|---|
5 | 2 | |
3,446 | 56 | |
2.2% | - | |
10.0 | 4.4 | |
4 days ago | 2 months ago | |
C++ | C++ | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
oneDNN
-
Blaze: A High Performance C++ Math library
If you are talking about non-small matrix multiplication in MKL, is now in opensource as a part of oneDNN. It literally has exactly the same code, as in MKL (you can see this by inspecting constants or doing high-precision benchmarks).
For small matmul there is libxsmm. It may take tremendous efforts make something faster than oneDNN and libxsmm, as jit-based approach of https://github.com/oneapi-src/oneDNN/blob/main/src/gpu/jit/g... is too flexible: if someone finds a better sequence, oneDNN can reuse it without major change of design.
But MKL is not limited to matmul, I understand it...
-
Arc & Deep Learning Frameworks
For completeness, it looks like this question was posted to the oneDNN GitHub repo and the response was to stay tune for updates.
- Keeping POWER relevant in the open source world
-
Intel oneDNN 2.5 released with experimental RISC-V support
From the release note of oneDNN v2.5:
-
Is gpu hardware tied to cpu ISA ?
Intel are trying to support their oneAPI compute framework on Arm and IBM POWER and z/Architecture (s390x) but since they ever released only a single discrete GPU with the Xe architecture it's unclear whether they'll support Xe GPU compute on e.g. ARM https://github.com/oneapi-src/oneDNN
peakperf
-
lscpu + neofetch = cpufetch
Dr-Noob here. Created an account just to comment on this post. I appreciate all of your comments.
For the ones who think that cpufetch uses lscpu (especially the one who wrote the title of this post), please see https://www.reddit.com/r/linux/comments/milnza/cpufetch_simp...
About the peak performance, nezirus, the purpose is to have a quick look of how powerful a CPU is supposed to be. Peak performance does not measure the real performance of a CPU but it is a rough estimate of it. The peak performance is one of the distinguishing marks of cpufetch and is one of my favorite fields of cpufetch. Concerning the fight between Gold 6238 and EPYC 7702P, is not the other way around. If you are able to use the full power of the CPU, Gold is much more powerful. However, in a real program, this is not always true. For more information about the peak performance, see https://github.com/Dr-Noob/peakperf. There you will understand how peak performance is calculated and how it works.
Thank you very much for your "text screenshots", I really like to see my program on all this variety of hardware!
-
cpufetch - Simplistic yet fancy CPU architecture fetching tool (supports x86_64 and ARM)
If you are interested, you can find more information in another project of mine, peakperf (https://github.com/Dr-Noob/peakperf).
What are some alternatives?
oneMKL - oneAPI Math Kernel Library (oneMKL) Interfaces
XiangShan - Open-source high-performance RISC-V processor
CTranslate2 - Fast inference engine for Transformer models
oneDPL - oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html
cpufetch - Simple yet fancy CPU architecture fetching tool
highway - Highway - A Modern Javascript Transitions Manager
png2ascii - Lightning fast ASCII image generator
asmjit - Low-latency machine code generation
x86info - x86info : x86 processor register decoder.
librealsense - IntelĀ® RealSenseā¢ SDK
duf - Disk Usage/Free Utility - a better 'df' alternative