FastHash | minperf | |
---|---|---|
3 | 2 | |
19 | 156 | |
- | - | |
6.7 | 10.0 | |
6 months ago | about 3 years ago | |
C# | Java | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
FastHash
-
The Smallest Hash Table
I've implemented your RecSplit method in my MPHF benchmark library (written in C#). The suite is not yet public, but I do want to say thank you for your fantastic method/code/paper. In my own rabbithole research, I stumbled upon several artifacts of your rabbithole trail. Most notably the stuff on StackOverflow, which helped my own research.
I've releaed a set og fast hash functions[1] to help gain an understanding of speed vs. quality. My biggest takeaway is that most generic hash functions can be specialized for integer inputs[2], which often reduce latency by quite a lot, making MPFH more attractive over simple iteration on small sets, as the overhead of hashing is considerably smaller.
[1] https://github.com/Genbox/FastHash
[2] https://github.com/Genbox/FastHash/blob/master/src/FastHash/...
-
The quick and practical “MSI” hash table
It was made (by Sanmayce) to optimize for instruction-level pipelining, and use the fact that modern CPUs have multiple execution ports. But due to those changes, it is not compatible with FNV1a anymore.
The trick of reading in stripes is employed by many of the fastest hashes. It is kinda funny to see how one author prefers a switch case over for loops, where others prefer while loops. The differences can sometimes have a big impact on what optimizations the compiler decides to use.
[1] https://github.com/Genbox/FastHash/blob/master/src/FastHash....
-
Ask HN: What Are You Working On? (August 2022)
At the moment I'm working on FastHash[1], a pet project of mine to port a few high-performance non-cryptographic hash functions to C#.
I'm also trying to build FastLinq, a value-by-reference Language Integrated Query (LINQ) optimized for high-performance scenarios. It is kind of a weird mix as LINQ in .NET is known for its high overhead.
Finally, I'm working on an Office setting synchronization application. I heard a podcast with Paul Thurrott complaining about the lack of sync solutions, so I thought I would do one for fun.
[1] https://github.com/Genbox/FastHash
minperf
-
The Smallest Hash Table
Yes! That's the implementation from the paper. There's also a Java implementation: https://github.com/thomasmueller/minperf
-
Hash, displace, and compress: Perfect hashing with Java
I understand this is mostly educational. However, there is a Java library for perfect hashing, https://github.com/thomasmueller/minperf. I used it once, it wasn't the fastest thing in the worlds, but it did the job.
What are some alternatives?
PicoPico - Pico-8 Player
Zero-Allocation-Hashing - Zero-allocation hashing for Java
reframe - LeapTable 🦘- The fastest way to build, deploy, and manage LLM-powered agents on tabular data (dataframes, SQL tables and Spreadsheets). [Moved to: https://github.com/peterwnjenga/leaptable]
smhasher - Hash function quality and speed tests
skeleton - A fully featured UI toolkit for Svelte + Tailwind. [Moved to: https://github.com/skeletonlabs/skeleton]
needle - A CLI tool that finds a needle (opening/intro and ending/credits) in a haystack (TV or anime episode).
aHash - aHash is a non-cryptographic hashing algorithm that uses the AES hardware instruction
pyroscope-rs - Pyroscope Profiler for Rust. Profile your Rust applications.
open-recipe-project - Free, and open recipes for anyone to use
.NET Runtime - .NET is a cross-platform runtime for cloud, mobile, desktop, and IoT apps.
oxide - Teach your PostgreSQL database how to speak MongoDB Wire Protocol
referencesource - Source from the Microsoft .NET Reference Source that represent a subset of the .NET Framework