md5-optimisation vs oneDNN

md5-optimisation

The fastest MD5 implementation using x86 assembly (by animetosho)

Md5 Avx512

Source Code

Suggest alternative

Edit details

oneDNN

oneAPI Deep Neural Network Library (oneDNN) (by oneapi-src)

Onednn Oneapi Deep Learning deep-neural-networks Performance CPP Openmp Tbb x86-64 X64 Aarch64 Avx512 amx xe-architecture Library bfloat16 Sycl Vnni

Source Code

uxlfoundation.org

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

md5-optimisation		oneDNN
	Project
2	Mentions	5
97	Stars	3,474
-	Growth	2.1%
2.8	Activity	10.0
about 1 year ago	Latest Commit	about 12 hours ago
C++	Language	C++
-	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

md5-optimisation

Posts with mentions or reviews of md5-optimisation. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-19.

The least interesting part about AVX-512 is the 512 bits vector width
2 projects | news.ycombinator.com | 19 Jun 2023

Very useful. In fact, it speeds up a single instance (i.e. not taking advantage of SIMD) of MD5 by 20%: https://github.com/animetosho/md5-optimisation#x86-avx512-vl...
MD5 Optimisation Tricks: Beating OpenSSL’s Hand-Tuned Assembly
1 project | news.ycombinator.com | 5 Feb 2023

oneDNN

Posts with mentions or reviews of oneDNN. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-17.

Blaze: A High Performance C++ Math library
7 projects | news.ycombinator.com | 17 Apr 2024

If you are talking about non-small matrix multiplication in MKL, is now in opensource as a part of oneDNN. It literally has exactly the same code, as in MKL (you can see this by inspecting constants or doing high-precision benchmarks).
For small matmul there is libxsmm. It may take tremendous efforts make something faster than oneDNN and libxsmm, as jit-based approach of https://github.com/oneapi-src/oneDNN/blob/main/src/gpu/jit/g... is too flexible: if someone finds a better sequence, oneDNN can reuse it without major change of design.
But MKL is not limited to matmul, I understand it...
Arc & Deep Learning Frameworks
1 project | /r/intel | 6 Oct 2022

For completeness, it looks like this question was posted to the oneDNN GitHub repo and the response was to stay tune for updates.
Keeping POWER relevant in the open source world
9 projects | news.ycombinator.com | 22 Jan 2022
Intel oneDNN 2.5 released with experimental RISC-V support
2 projects | /r/RISCV | 9 Dec 2021

From the release note of oneDNN v2.5:
Is gpu hardware tied to cpu ISA ?
1 project | /r/hardware | 11 Jan 2021

Intel are trying to support their oneAPI compute framework on Arm and IBM POWER and z/Architecture (s390x) but since they ever released only a single discrete GPU with the Xe architecture it's unclear whether they'll support Xe GPU compute on e.g. ARM https://github.com/oneapi-src/oneDNN

What are some alternatives?

When comparing md5-optimisation and oneDNN you can also consider the following projects:

kfr - Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

oneMKL - oneAPI Math Kernel Library (oneMKL) Interfaces

xsimd - C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

CTranslate2 - Fast inference engine for Transformer models

oneDPL - oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html

highway - Highway - A Modern Javascript Transitions Manager

asmjit - Low-latency machine code generation

librealsense - Intel® RealSense™ SDK

Reloaded-II - Next Generation Universal .NET Core Powered Mod Loader compatible with anything X86, X64.

faasm - High-performance stateful serverless runtime based on WebAssembly

peakperf - Achieve peak performance on x86 CPUs and NVIDIA GPUs

openvino - OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

md5-optimisation vs kfr oneDNN vs oneMKL md5-optimisation vs xsimd oneDNN vs CTranslate2 oneDNN vs oneDPL oneDNN vs highway oneDNN vs asmjit oneDNN vs librealsense oneDNN vs Reloaded-II oneDNN vs faasm oneDNN vs peakperf oneDNN vs openvino

Compare md5-optimisation vs oneDNN and see what are their differences.

md5-optimisation

oneDNN

md5-optimisation

oneDNN

What are some alternatives?