hashtable-benchmarks vs unordered_dense

hashtable-benchmarks

An Evaluation of Linear Probing Hashtable Algorithms (by senderista)

Suggest topics

Source Code

senderista.github.io

Suggest alternative

Edit details

unordered_dense

A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion (by martinus)

C++ Cpp17 CPP Hash hash-tables header-only-library no-dependencies stl-containers unordered-map unordered-set

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

hashtable-benchmarks		unordered_dense
	Project
8	Mentions	12
29	Stars	730
-	Growth	-
4.7	Activity	7.1
5 months ago	Latest Commit	22 days ago
Java	Language	C++
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

hashtable-benchmarks

Posts with mentions or reviews of hashtable-benchmarks. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-20.

Building a faster hash table for high performance SQL joins
3 projects | news.ycombinator.com | 20 Dec 2023

Since the blog post mentioned a PR to replace linear probing with Robin Hood, I just wanted to mention that I found bidirectional linear probing to outperform Robin Hood across the board in my Java integer set benchmarks:
https://github.com/senderista/hashtable-benchmarks/blob/mast...
https://github.com/senderista/hashtable-benchmarks/wiki/64-b...
Ask HN: Who wants to be hired? (December 2023)
26 projects | news.ycombinator.com | 1 Dec 2023

https://homes.cs.washington.edu/~magda/papers/wang-cidr17.pd...
I'm most interested in developing high-performance database engines in low-level languages, but open to any challenging systems programming project. I've been working in C++ for the last 3 years, but have written nontrivial projects in Rust and Java as well (e.g., https://github.com/senderista/rotated-array-set, https://github.com/senderista/hashtable-benchmarks). I would enjoy using Rust or Zig on a new project, but I consider the project itself to be much more important than the language it's written in. I am not interested in cryptocurrency, adtech, or fintech projects.
Factor is faster than Zig
11 projects | news.ycombinator.com | 10 Nov 2023

Thanks for the details on your benchmarks. I would like sometime to extend BLP to a more generic setting; as I said I think any trick used with RH would also work with BLP. I just used an integer set because that's all I needed for my use case and it was easy to implement several different approaches for benchmarking. As you note, it favors use cases where the hash function is cheap (or invertible) and elements are cheap to move around.
About your question on load factors: no, the benchmarks are measuring exactly what they claim to be. The hash table constructor divides max data size by load factor to get the table size (https://github.com/senderista/hashtable-benchmarks/blob/mast...), and the benchmark code instantiates each hash table for exactly the measured data set size and load factor (https://github.com/senderista/hashtable-benchmarks/blob/mast...).
I can't explain the peaks around 1M in many of the plots; I didn't investigate them at the time and I don't have time now. It could be a JVM artifact, but I did try to use JMH "best practices", and there's no dynamic memory allocation or GC happening during the benchmark at all. It would be interesting to port these tables to Rust and repeat the measurements with Criterion. For more informative graphs I might try a log-linear approach: divide the intervals between the logarithmically spaced data sizes into a fixed number of subintervals (say 4).
Inside boost::unordered_flat_map
11 projects | /r/cpp | 18 Nov 2022

I think "bidirectional linear probing" is an underrated approach (and much simpler): https://github.com/senderista/hashtable-benchmarks/blob/master/src/main/java/set/int64/BLPLongHashSet.java
A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion
5 projects | /r/cpp | 4 Jul 2022

I will probably never get around to porting my bidirectional linear probing integer hash set from Java to C++, but I hope someone can try adapting BLP to general C++ hashmaps and hashsets, because it significantly outperforms Robin Hood in my benchmarks.
Ask HN: Who wants to be hired? (March 2022)
14 projects | news.ycombinator.com | 1 Mar 2022

https://homes.cs.washington.edu/~magda/papers/wang-cidr17.pd...
I'm most interested in developing high-performance database engines in low-level languages, but open to any challenging systems programming project. I've been working in C++ for the last 2 years, but have written nontrivial projects in Rust and Java as well (e.g., https://github.com/senderista/rotated-array-set, https://github.com/senderista/hashtable-benchmarks). I would enjoy using Rust or Zig on a new project, but I consider the project itself to be much more important than the language it's written in. I am not interested in cryptocurrency, adtech, or fintech projects.

unordered_dense

Posts with mentions or reviews of unordered_dense. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-09.

unordered_dense: A Fast & Densely Stored Hashmap And Hashset Based On Robin-Hood Backward Shift Deletion
1 project | /r/programming | 11 Jul 2023
unordered_dense: A fast, densely stored hashmap based on backward shift deletion
1 project | news.ycombinator.com | 11 Jul 2023
boost::unordered standalone
3 projects | /r/cpp | 9 Jul 2023

That's deprecated. Use https://github.com/martinus/unordered_dense instead And yes, tell use if it's any better(it should)
Is there an accepted way to order qualifiers?
2 projects | /r/cpp | 23 Apr 2023

No, I've deprecated it because the code has become a mess, I rewrote it quite differently and with much higher code quality and more features here: https://github.com/martinus/unordered_dense
Effortless Performance Improvements in C++: std::unordered_map
3 projects | /r/cpp | 2 Mar 2023

When no ordering is necessary and the number of elements is larger than 20, nothing beats https://github.com/martinus/unordered_dense (for general use).
Effortless Performance Improvements in C++: std:unordered_map
4 projects | news.ycombinator.com | 2 Mar 2023

https://github.com/martinus/unordered_dense
Check this one out, it's a successor to this idea. Boost also introduced a very performant flat_hash_map in 1.81
Fuzzing is Cool, Actually
1 project | /r/cpp | 14 Jan 2023

I have an API fuzz test for a hash map here: https://github.com/martinus/unordered_dense/blob/main/test/fuzz/api.cpp
A container with set interface based on std::vector
3 projects | /r/cpp | 22 Dec 2022

That sounds a bit like ankerl::unordered_dense::set: https://github.com/martinus/unordered_dense
Inside boost::unordered_flat_map
11 projects | /r/cpp | 18 Nov 2022
martinus/unordered_dense v1.4.0: A fast & densely stored hashmap, Now with heterogeneous overloads
2 projects | /r/cpp | 2 Oct 2022

What are some alternatives?

When comparing hashtable-benchmarks and unordered_dense you can also consider the following projects:

myria - Myria is a scalable Analytics-as-a-Service platform based on relational algebra.

robin-map - C++ implementation of a fast hash map and hash set using robin hood hashing

js2scheme

STC - A modern, user friendly, generic, type-safe and fast C99 container library: String, Vector, Sorted and Unordered Map and Set, Deque, Forward List, Smart Pointers, Bitset and Random numbers.

flat_hash_map - A very fast hashtable

robin-hood-hashing - Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20

unordered - Boost.org unordered module

nafeez.xyz - ⚡ My personal website.

Personal-Site-Gourav.io - My personal site & blog made with NextJS, Typescript, Tailwind CSS, MDX, Notion as CMS. Deployed on Vercel : https://gourav.io

parallel-hashmap - A family of header-only, very fast and memory-friendly hashmap and btree containers.