XNNPACK vs HIP-CPU

XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web (by google)

Source Code

Suggest alternative

Edit details

HIP-CPU

An implementation of HIP that works on CPUs, across OSes. (by ROCm)

Hip hip-runtime hip-portability hip-kernel-language Cuda cuda-programming Cpp17 stl-algorithms parallel-algorithms spmd

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

XNNPACK		HIP-CPU
	Project
8	Mentions	5
1,700	Stars	104
1.6%	Growth	2.9%
9.9	Activity	7.2
6 days ago	Latest Commit	about 1 month ago
C	Language	C++
GNU General Public License v3.0 or later	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

XNNPACK

Posts with mentions or reviews of XNNPACK. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-02.

Xnnpack: High-efficiency floating-point neural network inference operators
1 project | news.ycombinator.com | 25 Dec 2023
Can a NPU be used for vectors?
1 project | /r/RISCV | 29 Aug 2023
Performance critical ML: How viable is Rust as an alternative to C++
4 projects | /r/rust | 2 May 2023

Why are you writing your own inference code in C++ or Rust instead of using some kind of established framework like XNNPACK?
[P] Pure C/C++ port of OpenAI's Whisper
10 projects | /r/MachineLearning | 10 Oct 2022
[Discussion] Is XNNPACK a part of mediapipe? or should be additionally configured with mediapipe?
1 project | /r/opencv | 29 Jan 2022

XNNPACK - https://github.com/google/XNNPACK
WebAssembly Techniques to Speed Up Matrix Multiplication by 120x
4 projects | news.ycombinator.com | 25 Jan 2022
Prediction: Macs won't see many new games, no matter how powerful their hardware is
2 projects | /r/apple | 28 Oct 2021

Ok, concrete example time! At work, we're going to be using some software which includes XNNPACK, which is a library of highly-optimised operations for doing neural-network inference. This is the sort of thing where people have gone in and specifically tuned for performance, and nope, there's no attempt at all made to have code which is different for Intel/AMD or Apple/Other ARM. What they target is elements of the ISA, like NEON (i.e. ARM SIMD) and SSE, AVX etc. on x86(-64). And Wasm SIMD for Wasm.
Where are Nvidia's DLSS models stored and how big are they?
1 project | /r/hardware | 28 Mar 2021

It's quite simple. https://github.com/google/XNNPACK for example.

HIP-CPU

Posts with mentions or reviews of HIP-CPU. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-10-10.

HIP CPU
1 project | /r/ROCm | 18 Feb 2023

1 project | /r/Amd | 18 Feb 2023
[P] Pure C/C++ port of OpenAI's Whisper
10 projects | /r/MachineLearning | 10 Oct 2022
AMD publishes GPUFORT as Open Source to address CUDA’s dominance
2 projects | /r/Amd | 8 Oct 2021

If I'm reading this right, this is Fortran's equivalent of HIP, i.e. a way to (semi-)automatically convert CUDA-based solution to a more backend-independent one so that the same source can be run both on CUDA and ROCm GPUs (and potentially more; e.g. they also have an experimental CPU backend).
Test Coverage with CUDA
1 project | /r/gpgpu | 26 Aug 2021

So, I know that you asked about cuda, but this might actually be possible in hip, and you can convert your code to hip relatively easily. The path would be to use the CPU implementation (https://github.com/ROCm-Developer-Tools/HIP-CPU) and then run your code coverage on that.

What are some alternatives?

When comparing XNNPACK and HIP-CPU you can also consider the following projects:

ncnn - ncnn is a high-performance neural network inference framework optimized for the mobile platform

AdaptiveCpp - Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

gemm-benchmark - Simple [sd]gemm benchmark, similar to ACES dgemm

libcudacxx - [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

cpuid2cpuflags - Tool to generate CPU_FLAGS_* for your CPU

rocFFT - Next generation FFT implementation for ROCm

wasmblr - C++ WebAssembly assembler in a single header file

HIP - HIP: C++ Heterogeneous-Compute Interface for Portability

Genann - simple neural network library in ANSI C

stdgpu - stdgpu: Efficient STL-like Data Structures on the GPU

ruby-fann - Ruby library for interfacing with FANN (Fast Artificial Neural Network)

AITemplate - AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

XNNPACK vs ncnn HIP-CPU vs AdaptiveCpp XNNPACK vs gemm-benchmark HIP-CPU vs libcudacxx XNNPACK vs cpuid2cpuflags HIP-CPU vs rocFFT XNNPACK vs wasmblr HIP-CPU vs HIP XNNPACK vs Genann HIP-CPU vs stdgpu XNNPACK vs ruby-fann HIP-CPU vs AITemplate

Compare XNNPACK vs HIP-CPU and see what are their differences.

XNNPACK

HIP-CPU

XNNPACK

HIP-CPU

What are some alternatives?