umash
UMASH: a fast enough hash and fingerprint with collision bounds (by backtrace-labs)
meow_hash
Official version of the Meow hash, an extremely fast level 1 hash (by cmuratori)
umash | meow_hash | |
---|---|---|
1 | 14 | |
147 | 1,684 | |
0.0% | - | |
0.0 | 0.0 | |
over 1 year ago | almost 2 years ago | |
Python | C++ | |
MIT License | zlib License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
umash
Posts with mentions or reviews of umash.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-10-29.
-
Meow Hash
umash (https://github.com/backtrace-labs/umash) has a similar structure PH block structure, but was designed for decent bit mixing (enough to satisfy smhasher, unlike CLHASH, which needs an additional finalizer) with a lower fixed time cost: 22 cycles for a one-byte hash.
I'm not sure how one would use that linear regression. What kind of hardware offers 675 GB/s of memory bandwidth? 140 bytes/cycle is easily more than twice the L2 read bandwidth offered by any COTS chip I'm aware of. There are also warm up effects past the fixed cost of setup and finalizers that slow down hashing for short input. For what range of input sizes (and hot/cold cache state) would you say the regression is a useful model?
meow_hash
Posts with mentions or reviews of meow_hash.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-10-29.
-
Use fast data algorithms (2021)
I'm late on the reply but I was using xxhash for something similar, but found that Meow hash was faster for me. Feel free to benchmark, since I'm sure it could vary depending on CPU architecture.
- Meow Hash (2018)
- Meow Hash
-
Cryptanalysis of Meow Hash
for everyone who doesn't think highly of Casey Muratori (or at least the way he conducts himself online), the author of Meow Hash, took the criticism quite graciously:
https://twitter.com/cmuratori/status/1417546500083568641
https://github.com/cmuratori/meow_hash/issues/80
- Full 128-bit collision between two files in Meow Hash
-
Reading a file in parallel
Is a standardized hashing algorithm required? SHA256 takes several CPU cycle per byte, but there are several hashing algorithm utilizing AES-NI that gives them 15-16 bytes per cycles, which translates to tens of gigabytes per second on modern CPU. At least one of them has been ported to .NET Core, and from what I see properly calls AES primitives.
-
76% Faster CPython
MeowHash is the fastest non-cryptographic hash available online, and if it were used in CPython, which performs a hash for every fundamental operation in the language, then Python would be much faster.
-
New Bare Hash Map: 2X-3X Speedup over SOTA
Meow hash claims 3-4x faster hashing over this, still passes smhasher, and is a few years old. https://mollyrocket.com/meowhash
-
A Hashmap for the C Programming Language
Have you seen the meow hash implementation? Would that be suitable for this ?