SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Avx Open-Source Projects
-
thorium
Chromium fork named after radioactive element No. 90. Windows and MacOS/Raspi/Android/Special builds are in different repositories, links are towards the top of the README.md.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
-
Simd
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. (by ermig1979)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
-
DirectXMath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
-
Thorium-Win
Chromium fork for Windows named after radioactive element No. 90; Windows builds of https://github.com/Alex313031/Thorium
-
sha256-simd
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
-
Mercury
Firefox fork with compiler optimizations and patches from Librewolf, Waterfox, and GNU IceCat. (by Alex313031)
-
MIPP
MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).
-
Thorium-Win-AVX2
Repo to serve AVX2 Windows builds of Thorium. https://github.com/Alex313031/Thorium/
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Supermium – Chromium fork for Win 2003 and newer | news.ycombinator.com | 2024-03-03
Project mention: Llamafile 0.7 Brings AVX-512 Support: 10x Faster Prompt Eval Times for AMD Zen 4 | news.ycombinator.com | 2024-03-31The bf16 dot instruction replaces 6 instructions: https://github.com/google/highway/blob/master/hwy/ops/x86_12...
Project mention: Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller | news.ycombinator.com | 2023-10-31Just a point of clarification - faster-whisper references it but ctranslate2[0] is what's really doing the magic here.
Ctranslate2 is a sleeper powerhouse project that enables a lot. They should be up front and center and get the credit they deserve.
[0] - https://github.com/OpenNMT/CTranslate2
I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef
https://github.com/xtensor-stack/xsimd
GH topics > HashMap:
I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef
Firefox and then Brave if I need a Chromium based browser, although I think I am going to ditch Brave for Thorium. Maybe even Edge, it's a lot better now.
BLAKE3 is faster than hardware accelerated SHA-2 because the tree mode used in BLAKE3 allows hashing parts of a single message in parallel (with SHA-2, parts of a single message have to be hashed one after another, and parallelism is only used in workloads where you process multiple messages at the same time).
https://github.com/minio/sha256-simd
https://github.com/BLAKE3-team/BLAKE3
Project mention: Mercury – Compiler optimized, private Firefox fork | news.ycombinator.com | 2024-03-04
I'm the main author of Highway, so I have some opinions :D Number of operations/platforms supported are important criteria.
A hopefully unbiased commentary:
Simde allows you to take existing nonportable intrinsics and get them to run on another platform. This is useful when you have a bunch of existing code and tight deadlines. The downside is less than optimal performance - a portable abstraction can be more efficient than forcing one platform to exactly match the semantics of another. Although a ton of effort has gone into Simde, sometimes it also resorts to autovectorization which may or may not work.
Eigen and SLEEF are mostly math-focused projects that also have a portability layer. SLEEF is designed for C and thus has type suffixes which are rather verbose, see https://github.com/shibatch/sleef/blob/master/src/libm/sleef... But it offers a complete (more so than Highway's) libm.
neither proposing nor taking a position on this possible addition)
> ... For completeness we would also like to add that a serious issue is that C still lacks vector operations.
Those are good points. The authors don't take a stance on it, but I do think that syntax for packed structs should be standardized. IMO, so should syntax for inline assembly (both as optional features). These are already common extensions; this is exactly what they should standardize. The additions of "typeof" and #embed are also good examples of this (they had been talking about adding #embed since 1995 [1]).
As for vector instructions, I'm unsure how it could be implemented in a standard way, but I'm not against it. Maybe something like this [2], but with the syntax changed for C instead of C++.
[1]: https://groups.google.com/g/comp.std.c/c/zWFEXDvyTwM
[2]: https://github.com/VcDevel/std-simd
I've also run into this thinking, and have been looking to solve it in codebases I'm working on.
I've run across: https://github.com/aff3ct/MIPP but have not worked with it extensively yet. It looks to be a solution to the rewriting X parallel pipeline into Y SIMD extensions.
Perhaps something like this, or languages introducing something similar into their standard libraries/modules would be a solution.
None of this of course solves the run-time detection of capability/growing binary size to support such.
FYI a number of streaming sites won't work - while this has Widevine, it does not have Verified Media Path (VMP) which verifies that you're running a signed binary. https://github.com/Alex313031/Thorium-Win-AVX2/issues/84#iss...
https://github.com/castlabs/electron-releases is an interesting Electron fork with full Widevine+VMP support - but it's very much closed-source.
Project mention: Show HN: The fastest Turbo-Base64 now for Python | news.ycombinator.com | 2023-08-24** Cython bindings for Turbo Base64 [1] **
- 20-30x faster than the standard library
- Benchmarks faster than any other C base64 library
- Fastest implementation of AVX, AVX2, and AVX512 base64 encoding
- No other dependencies
[1] - https://github.com/powturbo/Turbo-Base64
Avx related posts
- The Curious Case of MD5
- Beating GCC 12 - 118x Speedup for Jensen Shannon Divergence via AVX-512FP16
- Show HN: Beating GCC 12 – 118x Speedup for Jensen Shannon D. Via AVX-512FP16
- SimSIMD v2: Vector Similarity Functions 3x-200x Faster than SciPy and NumPy
- SimSIMD v2: 3-200x Faster Vector Similarity Functions than SciPy and NumPy
- Comparing Vectors 3-200x Faster than SciPy and NumPy
- SIMD Everywhere Optimization from ARM Neon to RISC-V Vector Extensions
-
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024
Index
What are some of the best open-source Avx projects? This list will help you:
Project | Stars | |
---|---|---|
1 | thorium | 3,977 |
2 | highway | 3,645 |
3 | CTranslate2 | 2,776 |
4 | simde | 2,167 |
5 | cglm | 2,043 |
6 | xsimd | 2,036 |
7 | Simd | 1,974 |
8 | kfr | 1,587 |
9 | DirectXMath | 1,481 |
10 | Vc | 1,418 |
11 | Thorium-Win | 1,156 |
12 | sha256-simd | 933 |
13 | Mercury | 932 |
14 | eve | 843 |
15 | sleef | 586 |
16 | std-simd | 544 |
17 | MIPP | 459 |
18 | hlslpp | 451 |
19 | Thorium-Win-AVX2 | 350 |
20 | nsimd | 315 |
21 | Turbo-Base64 | 253 |
22 | Hybridizer | 230 |
23 | Thorium-Special | 180 |
Sponsored