Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
x86-simd-sort
C++ header file library for high performance SIMD based sorting algorithms for primitive datatypes (by natmaurice)
Nice. Would you like to add our vqsort to your benchmark? (Note: we haven't yet implemented a workaround specifically for AMD's compressstoreu, but do not use it for 64 nor 128-bit keys.)
Alright. The benchmark code itself isn't mine, it's Intel's.
For a workaround, I've forked the aforementioned x86-simd-sort repo with the emulated version for Zen 4. To enable the workaround during compilation, run SW_VCOMPRESS=1 make.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Llamafile 0.7 Brings AVX-512 Support: 10x Faster Prompt Eval Times for AMD Zen 4
- Permuting Bits with GF2P8AFFINEQB
- AMD EPYC 97x4 “Bergamo” CPUs: 128 Zen 4c CPU Cores for Servers, Shipping Now
- 10~17x faster than what? A performance analysis of Intel' x86-SIMD-sort(AVX-512)
- The Most Useful Numbers You've Never Heard Of (Veritasium video on p-adic numbers)