toys
sse-popcount
toys | sse-popcount | |
---|---|---|
2 | 2 | |
311 | 312 | |
- | - | |
5.2 | 5.6 | |
16 days ago | about 1 month ago | |
C++ | C++ | |
BSD 2-clause "Simplified" License | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
toys
-
Modern Perfect Hashing for Strings
I think all of these techniques check whether the input string is correct. For example see here https://github.com/WojciechMula/toys/blob/master/lookup-in-s...
- Good books or resources for x64 SIMD instructions? (SSE, AVX)
sse-popcount
-
Fast bitset decoding using Intel AVX-512
https://developer.arm.com/documentation/ddi0596/2020-12/SIMD...
I believe it does 128 bits per instruction, but I'm still struggling with rust w/ asm.
Along my journeys, however, I found this repo https://github.com/WojciechMula/sse-popcount/ which has tons of competing simd implementations for both intel and arm.
-
Counting set bits in an interesting way
The builtin POPCNT that came with Intel's SSE4 (SSE4a for AMD) is much faster. However, at a certain point, using AVX2 (and AVX-512 if present) is actually faster yet [1] - at least for 512 byte inputs or larger.
[1]: https://github.com/WojciechMula/sse-popcount
What are some alternatives?
exiting - Safely shutdown http://hapijs.com servers.
libsimdpp - Portable header-only C++ low level SIMD library
boilerplate - A friendly, proven starting place for your next hapi plugin or deployment
highway - Performance-portable, length-agnostic SIMD with runtime dispatch
hapipal-realworld-example-app - Exemplary real world application built with hapi pal
Vc - SIMD Vector Classes for C++
underdog - HTTP/2 server-push for hapi
simdjson - Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
hpal - hapi pal CLI
oneDNN - oneAPI Deep Neural Network Library (oneDNN)
Simd - C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.