Top 18 C++ Avx Projects

thorium

103 3,977 9.8 C++

Chromium fork named after radioactive element No. 90. Windows and MacOS/Raspi/Android/Special builds are in different repositories, links are towards the top of the README.md.

Project mention: Supermium – Chromium fork for Win 2003 and newer | news.ycombinator.com | 2024-03-03

highway

66 3,623 9.8 C++

Performance-portable, length-agnostic SIMD with runtime dispatch

Project mention: Llamafile 0.7 Brings AVX-512 Support: 10x Faster Prompt Eval Times for AMD Zen 4 | news.ycombinator.com | 2024-03-31

The bf16 dot instruction replaces 6 instructions: https://github.com/google/highway/blob/master/hwy/ops/x86_12...

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
CTranslate2

13 2,776 9.0 C++

Fast inference engine for Transformer models

Project mention: Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller | news.ycombinator.com | 2023-10-31

Just a point of clarification - faster-whisper references it but ctranslate2[0] is what's really doing the magic here.
Ctranslate2 is a sleeper powerhouse project that enables a lot. They should be up front and center and get the credit they deserve.
[0] - https://github.com/OpenNMT/CTranslate2

xsimd

3 2,036 8.7 C++

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

Project mention: GDlog: A GPU-Accelerated Deductive Engine | news.ycombinator.com | 2023-12-03

https://github.com/xtensor-stack/xsimd
GH topics > HashMap:

Simd

1 1,971 9.6 C++

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. (by ermig1979)

Project mention: The Case of the Missing SIMD Code | news.ycombinator.com | 2023-06-08

I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef

kfr

2 1,582 9.2 C++

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
DirectXMath

13 1,481 6.8 C++

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Vc

6 1,418 6.1 C++

SIMD Vector Classes for C++
eve

6 843 8.8 C++

Expressive Vector Engine - SIMD in C++ Goes Brrrr (by jfalcou)
std-simd

9 544 1.1 C++

std::experimental::simd for GCC [ISO/IEC TS 19570:2018]

Project mention: A proposal for the next version of C [pdf] | news.ycombinator.com | 2024-01-20

neither proposing nor taking a position on this possible addition)
> ... For completeness we would also like to add that a serious issue is that C still lacks vector operations.
Those are good points. The authors don't take a stance on it, but I do think that syntax for packed structs should be standardized. IMO, so should syntax for inline assembly (both as optional features). These are already common extensions; this is exactly what they should standardize. The additions of "typeof" and #embed are also good examples of this (they had been talking about adding #embed since 1995 [1]).
As for vector instructions, I'm unsure how it could be implemented in a standard way, but I'm not against it. Maybe something like this [2], but with the syntax changed for C instead of C++.
[1]: https://groups.google.com/g/comp.std.c/c/zWFEXDvyTwM
[2]: https://github.com/VcDevel/std-simd

MIPP

1 459 8.3 C++

MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).

Project mention: The Case of the Missing SIMD Code | news.ycombinator.com | 2023-06-08

I've also run into this thinking, and have been looking to solve it in codebases I'm working on.
I've run across: https://github.com/aff3ct/MIPP but have not worked with it extensively yet. It looks to be a solution to the rewriting X parallel pipeline into Y SIMD extensions.
Perhaps something like this, or languages introducing something similar into their standard libraries/modules would be a solution.
None of this of course solves the run-time detection of capability/growing binary size to support such.

hlslpp

0 451 7.7 C++

Math library using hlsl syntax with SSE/NEON support
MandelbrotSSE

4 83 2.0 C++

Real-time Mandelbrot zoom via SSE, AVX, OpenMP, CUDA, XaoS...
peakperf

2 56 4.4 C++

Achieve peak performance on x86 CPUs and NVIDIA GPUs
VectorizedKernel

1 7 2.6 C++

Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures
ThinkingInSimd

1 3 2.3 C++

An essay comparing performance implications of ignoring AVX acceleration

Project mention: Fastest Branchless Binary Search | news.ycombinator.com | 2023-08-11

> In this case std::lower_bound is very slightly but consistently faster than sb_lower_bound. To always get the best performance it is possible for libraries to use sb_lower_bound whenever directly working on primitive types and std::lower_bound otherwise.
I will say that if this is the case, there are probably much better versions of binary search out there for primitive types. I made one just screwing around with SIMD that's 3x faster than std::lower_bound until becoming memory bound:
https://github.com/matthewkolbe/ThinkingInSimd/tree/main/alg...

fpng-java

1 2 9.1 C++

Java Wrapper for the fast, native FPNG Encoder

Project mention: CPNG, a backwards compatible fork of PNG | news.ycombinator.com | 2023-12-09

Someone made this: https://github.com/manticore-projects/fpng-java
Replacing zlib might give you a few percentage points' worth of difference, whilst fpnge would likely be several times faster.

fractals

1 1 10.0 C++

Mandelbrot renderer with SIMD (NEON/AVX) acceleration. (by alademm)
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ Avx related posts

SIMD Everywhere Optimization from ARM Neon to RISC-V Vector Extensions
6 projects | news.ycombinator.com | 29 Sep 2023
The Case of the Missing SIMD Code
7 projects | news.ycombinator.com | 8 Jun 2023
Numerical Computing in C++ Discussion
1 project | /r/cpp | 1 Apr 2023
Similarity Measures on Arm SVE and NEON, x86 AVX2 and AVX-512
2 projects | /r/simd | 25 Mar 2023
SIMD intrinsics and the possibility of a standard library solution
16 projects | /r/cpp | 8 Jan 2023
Optimizing compilers reload vector constants needlessly
7 projects | news.ycombinator.com | 6 Dec 2022
SPO600 project part 2
2 projects | dev.to | 13 Apr 2022
A note from our sponsor - WorkOS
workos.com | 25 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source Avx projects in C++? This list will help you:

	Project	Stars
1	thorium	3,977
2	highway	3,623
3	CTranslate2	2,776
4	xsimd	2,036
5	Simd	1,971
6	kfr	1,582
7	DirectXMath	1,481
8	Vc	1,418
9	eve	843
10	std-simd	544
11	MIPP	459
12	hlslpp	451
13	MandelbrotSSE	83
14	peakperf	56
15	VectorizedKernel	7
16	ThinkingInSimd	3
17	fpng-java	2
18	fractals	1