Top 4 C++ simd-programming Projects

  • highway

    Performance-portable, length-agnostic SIMD with runtime dispatch

    Project mention: JPEG XL and the Pareto Front | news.ycombinator.com | 2024-03-01

    [0] for those interested in Highway.

    It's also mentioned in [1], which starts off

    > Today we're sharing open source code that can sort arrays of numbers about ten times as fast as the C++ std::sort, and outperforms state of the art architecture-specific algorithms, while being portable across all modern CPU architectures. Below we discuss how we achieved this.

    [0] https://github.com/google/highway

    [1] https://opensource.googleblog.com/2022/06/Vectorized%20and%2..., which has an associated paper at https://arxiv.org/pdf/2205.05982.pdf.

  • Vc

    SIMD Vector Classes for C++

  • JetBrains Dev Survey

    What’s up with the C++ ecosystem in 2023? JetBrains Developer Ecosystem Survey 2023 has given us many interesting insights. The Embedded (37%) and Games (39%) industries are already working with C++20, developers are incorporating static analysis in their CI, and ChatGPT usage among coders is flourishing. Read on for more!

  • eve

    Expressive Vector Engine - SIMD in C++ Goes Brrrr (by jfalcou)

    Project mention: Lack of modern scientific libraries written in C | /r/C_Programming | 2023-04-06

    C++ offers tools for writing better APIs, and since the addition of concepts in C++20 it offers much better API enforcement. Writing an equivalent to libraries such as {fmt} or EVE is not possible in anything we’d call C.

  • volk

    The Vector Optimized Library of Kernels (by gnuradio)

    Project mention: RISC-V Business: Testing StarFive's VisionFive 2 SBC | news.ycombinator.com | 2023-03-03

    I wonder how much of the performance will improve when compilers get better at RISC-V.

    It's been a long time since I could beat the compiler at optimizing assembly on x86, yet in the end merely unrolling a loop and keeping an eye on write-read stalls I managed to get a simple "multiply array by const" about 56% faster:


    And that's with hardware that doesn't even have vector instructions! I'd understand GCC not supporting that yet.

    Some other quickstart docs and hot takes from me on this hardware: https://blog.habets.se/2023/01/VisionFive-2-quickstart.html

