vxsort-cpp
version2
vxsort-cpp | version2 | |
---|---|---|
2 | 6 | |
30 | 1,227 | |
- | 1.9% | |
5.9 | 5.8 | |
6 months ago | 3 months ago | |
C++ | C++ | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vxsort-cpp
-
Significantly faster quicksort using SIMD
Interesting post and paper, thanks!
Sometimes the state of the art is not found in another paper but somewhere else, e.g,. there is vxsort by Dan Shechter (damageboy):
https://github.com/damageboy/vxsort-cpp/tree/master
He uses a similar approach and while I'm not sure how it compares to the Blacher et al version, I expect it to be in the ballpark.
- Discussion Thread
version2
-
SIMD intrinsics and the possibility of a standard library solution
Vector class library - 938 GH stars
- Checking for the absence of a string, naive AVX-512 edition
-
-๐- 2022 Day 4 Solutions -๐-
Most of the time is spent parsing, but this problem lends itself nicely to a SIMD formulation, which using vectorclass doesn't even require detailed knowledge of the intrinsics. Hot runs take ~14 ยตs on a Core i9-12900K, including I/O. Full code is (here)[https://github.com/ahans/aoc2022/blob/main/cpp/day04.cc], the interesting part is this, where we process 32 elements at once:
- Significantly faster quicksort using SIMD
- Parsing JSON faster with Intel AVX-512
- What do you think is faster for batch-processing a lot of "double-type" arithmetic?
What are some alternatives?
vops
highway - Performance-portable, length-agnostic SIMD with runtime dispatch
aoc22 - Advent of Code solutions for 2022 (in Python)
advent2022
adventOfCode2022
simde - Implementations of SIMD instruction sets for systems which don't natively support them.
Day4 - My (messy) Python3 solution for day4's puzzle.
advent-of-code
adventOfCode2022 - For tracking my advent of code participation 2022
advent-of-code-2022-rust - My Rust advent of code 2022 solutions
simd_decimal - vectorized decimal parsing
aoc22 - aoc22