Intel Publishes Blazing Fast AVX-512 Sorting Library, Numpy Switching To It For 10~17x Faster Sorts

This page summarizes the projects mentioned and recommended in the original post on /r/programming

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • highway

    Performance-portable, length-agnostic SIMD with runtime dispatch

  • Nice. Would you like to add our vqsort to your benchmark? (Note: we haven't yet implemented a workaround specifically for AMD's compressstoreu, but do not use it for 64 nor 128-bit keys.)

  • x86-simd-sort

    C++ template library for high performance SIMD based sorting algorithms

  • Alright. The benchmark code itself isn't mine, it's Intel's.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • x86-simd-sort

    C++ header file library for high performance SIMD based sorting algorithms for primitive datatypes (by natmaurice)

  • For a workaround, I've forked the aforementioned x86-simd-sort repo with the emulated version for Zen 4. To enable the workaround during compilation, run SW_VCOMPRESS=1 make.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts