c-blosc
wyhash
c-blosc | wyhash | |
---|---|---|
1 | 9 | |
959 | 917 | |
0.6% | - | |
5.7 | 6.6 | |
about 2 months ago | 3 months ago | |
C | C | |
GNU General Public License v3.0 or later | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
c-blosc
-
WASM compression benchmarks and the cost of missing compression APIs
Related to compressing data before storing on SSD:
Blosc - faster than memcpy()
https://github.com/Blosc/c-blosc
On right circumstances Blosc is so fast that even speed ups reading data from RAM (read less, decompress in L1 and L2 caches)
wyhash
- Wyhash: The fastest quality hash function
-
What hash function you use for hash maps / hash tables?
I recently switched to wyhash as it seems to have a good combination of speed and stability.
-
Are there any weaker hashes than MD5, but still randomly distributed?
wyhash is a decent option for if you don't need a cryptographical quality hash
-
Hacker News top posts: Mar 15, 2021
New Bare Hash Map: 2X-3X Speedup over SOTA\ (32 comments)
-
New Bare Hash Map: 2X-3X Speedup over SOTA
I feel like you’d want something a bit safer than “we don’t store the keys and just rely on the hash to be really good” [1], putting “please do not use this for serious tasks” in a comment embedded in the header file isn’t a clear enough warning.
It’s not clear to me that that probability of collision assumptions hold. It’s basically assuming that the hashing is perfect and distributes any inputs to the full 64-bit space with uniform probability. That’s the usual hash map / randomized algorithm hope, but does BigCrush or similar avalanche testing really prove that? (Presumably not, otherwise there wouldn’t be image attacks for things like md5).
[1] https://github.com/wangyi-fudan/wyhash/blob/d2a305811972f391...
- wyhash and wyrand are a non-cryptographic 64-bit hash function and PRNG respectively
What are some alternatives?
lizard - Lizard (formerly LZ5) is an efficient compressor with very fast decompression. It achieves compression ratio that is comparable to zip/zlib and zstd/brotli (at low and medium compression levels) at decompression speed of 1000 MB/s and faster.
smhasher - Hash function quality and speed tests
lexbor - Lexbor is development of an open source HTML Renderer library. https://lexbor.com
aHash - aHash is a non-cryptographic hashing algorithm that uses the AES hardware instruction
FPC - FPC - Fast Prefix Coder
meow_hash - Official version of the Meow hash, an extremely fast level 1 hash
cgif - GIF encoder written in C
leocad - A CAD application for creating virtual LEGO models
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
smhasher - Automatically exported from code.google.com/p/smhasher
Mersenne-Twister-in-Python - A Mersenne Twister Random Number Generator
countwords - Playing with counting word frequencies (and performance) in various languages.