robin-hood-hashing vs abseil-cpp

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

robin-hood-hashing		abseil-cpp
	Project
23	Mentions	54
1,465	Stars	13,955
-	Growth	1.3%
0.0	Activity	9.5
about 1 year ago	Latest Commit	7 days ago
C++	Language	C++
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

robin-hood-hashing

Posts with mentions or reviews of robin-hood-hashing. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-10.

Factor is faster than Zig
11 projects | news.ycombinator.com | 10 Nov 2023

In my example the table stores the hash codes themselves instead of the keys (because the hash function is invertible)
Oh, I see, right. If determining the home bucket is trivial, then the back-shifting method is great. The issue is just that it’s not as much of a general-purpose solution as it may initially seem.
“With a different algorithm (Robin Hood or bidirectional linear probing), the load factor can be kept well over 90% with good performance, as the benchmarks in the same repo demonstrate.”
I’ve seen the 90% claim made several times in literature on Robin Hood hash tables. In my experience, the claim is a bit exaggerated, although I suppose it depends on what our idea of “good performance” is. See these benchmarks, which again go up to a maximum load factor of 0.95 (Although boost and Absl forcibly grow/rehash at 0.85-0.9):
https://strong-starlight-4ea0ed.netlify.app/
Tsl, Martinus, and CC are all Robin Hood tables (https://github.com/Tessil/robin-map, https://github.com/martinus/robin-hood-hashing, and https://github.com/JacksonAllan/CC, respectively). Absl and Boost are the well-known SIMD-based hash tables. Khash (https://github.com/attractivechaos/klib/blob/master/khash.h) is, I think, an ordinary open-addressing table using quadratic probing. Fastmap is a new, yet-to-be-published design that is fundamentally similar to bytell (https://www.youtube.com/watch?v=M2fKMP47slQ) but also incorporates some aspects of the aforementioned SIMD maps (it caches a 4-bit fragment of the hash code to avoid most key comparisons).
As you can see, all the Robin Hood maps spike upwards dramatically as the load factor gets high, becoming as much as 5-6 times slower at 0.95 vs 0.5 in one of the benchmarks (uint64_t key, 256-bit struct value: Total time to erase 1000 existing elements with N elements in map). Only the SIMD maps (with Boost being the better performer) and Fastmap appear mostly immune to load factor in all benchmarks, although the SIMD maps do - I believe - use tombstones for deletion.
I’ve only read briefly about bi-directional linear probing – never experimented with it.
If this isn't the perfect data structure, why?
3 projects | /r/C_Programming | 22 Oct 2023

From your other comments, it seems like your knowledge of hash tables might be limited to closed-addressing/separate-chaining hash tables. The current frontrunners in high-performance, memory-efficient hash table design all use some form of open addressing, largely to avoid pointer chasing and limit cache misses. In this regard, you want to check our SSE-powered hash tables (such as Abseil, Boost, and Folly/F14), Robin Hood hash tables (such as Martinus and Tessil), or Skarupke (I've recently had a lot of success with a similar design that I will publish here soon and is destined to replace my own Robin Hood hash tables). Also check out existing research/benchmarks here and here. But we a little bit wary of any benchmarks you look at or perform because there are a lot of factors that influence the result (e.g. benchmarking hash tables at a maximum load factor of 0.5 will produce wildly different result to benchmarking them at a load factor of 0.95, just as benchmarking them with integer keys-value pairs will produce different results to benchmarking them with 256-byte key-value pairs). And you need to familiarize yourself with open addressing and different probing strategies (e.g. linear, quadratic) first.
boost::unordered standalone
3 projects | /r/cpp | 9 Jul 2023

Also, FYI there is robin_hood::unordered_{map,set} which has very high performance, and is header-only and standalone.
Solving “Two Sum” in C with a tiny hash table
1 project | news.ycombinator.com | 29 Jun 2023

std::unordered_map is notoriously slow, several times slower than a "proper" hashmap implementation like Google's absl or Martin's robin-hood-hashing [1]. That said, std::sort is not the fastest sort implementation, either. It is hard to say which will win.
[1]: https://github.com/martinus/robin-hood-hashing
Convenient Containers v1.0.3: Better compile speed, faster maps and sets
4 projects | /r/C_Programming | 3 May 2023

The main advantage of the latest version is that it reduces build time by about 53% (GCC 12.1), based on the comprehensive test suit found in unit_tests.c. This improvement is significant because compile time was previously a drawback of this library, with maps and sets—in particular—compiling slower than their C++ template-based counterparts. I achieved it by refactoring the library to do less work inside API macros and, in particular, use fewer _Generic statements, which seem to be a compile-speed bottleneck. A nice side effect of the refactor is that the library can now more easily be extended with the planned dynamic strings and ordered maps and sets. The other major improvement concerns the performance of maps and sets. Here are some interactive benchmarks[1] comparing CC’s maps to two popular implementations of Robin Hood hash maps in C++ (as well as std::unordered_map as a baseline). They show that CC maps perform roughly on par with those implementations.
Effortless Performance Improvements in C++: std:unordered_map
4 projects | news.ycombinator.com | 2 Mar 2023

For anyone in a situation where a set/map (or unordered versions) is in a hot part of the code, I'd also highly recommend Robin Hood: https://github.com/martinus/robin-hood-hashing
It made a huge difference in one of the programs I was running.
Inside boost::unordered_flat_map
11 projects | /r/cpp | 18 Nov 2022
What are some cool modern libraries you enjoy using?
32 projects | /r/cpp | 18 Sep 2022

Oh my bad. Still thought -- your name.. it looks very familiar to me. Are you the robin_hood hashing guy perhaps? Yes you are! My bad -- https://github.com/martinus/robin-hood-hashing.
Performance comparison: counting words in Python, C/C++, Awk, Rust, and more
12 projects | news.ycombinator.com | 24 Jul 2022
Got a bit better C++ version here which uses a couple libraries instead of std:: stuff - https://gist.github.com/jcelerier/74dfd473bccec8f1bd5d78be5a... ; boost, fmt and https://github.com/martinus/robin-hood-hashing
```
    $ g++ -I robin-hood-hashing/src/include -O2 -flto -std=c++20 -fno-exceptions -fno-unwind-tables -fno-asynchronous-unwind-tables -lfmt
```
A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion
5 projects | /r/cpp | 4 Jul 2022

The implementation is mostly inspired by this comment and lessons learned from my older robin-hood-hashing hashmap.

abseil-cpp

Posts with mentions or reviews of abseil-cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-27.

Sane C++ Libraries
7 projects | news.ycombinator.com | 27 Jan 2024
Open source collection of Google's C++ libraries
1 project | news.ycombinator.com | 5 Jan 2024
Is Ada safer than Rust?
2 projects | news.ycombinator.com | 2 Dec 2023
Appending to an std:string character-by-character: how does the capacity grow?
2 projects | news.ycombinator.com | 26 Oct 2023

Yeah, it's nice! And Abseil does it, IFF you use LLVM libc++.
https://github.com/abseil/abseil-cpp/blob/master/absl/string...
The standard adopted it as resize_and_overwrite. Which I think is a little clunky.
Shaving 40% Off Google’s B-Tree Implementation with Go Generics
3 projects | news.ycombinator.com | 19 Sep 2023

This may be confusing to those familiar with Google's libraries. The baseline is the Go BTree, which I personally never heard of until just now, not the C++ absl::btree_set. The benchmarks aren't directly comparable, but the C++ version also comes with good microbenchmark coverage.
https://github.com/google/btree
https://github.com/abseil/abseil-cpp/blob/master/absl/contai...
Faster Sorting Beyond DeepMind’s AlphaDev
1 project | news.ycombinator.com | 19 Sep 2023
“Once” one-time concurrent initialization with an integer
2 projects | news.ycombinator.com | 1 Aug 2023

An implementation of call_once that accommodates callbacks that throw: https://github.com/abseil/abseil-cpp/blob/master/absl/base/c...
[R] AlphaDev discovers faster sorting algorithms
2 projects | /r/MachineLearning | 7 Jun 2023

I wouldn't say it's that cryptic. It's just a few bitwise rotations/shifts/xor operations.
Deepmind Alphadev: Faster sorting algorithms discovered using deep RL
3 projects | news.ycombinator.com | 7 Jun 2023

You can see hashing optimizations as well https://www.deepmind.com/blog/alphadev-discovers-faster-sort..., https://github.com/abseil/abseil-cpp/commit/74eee2aff683cc7d...
I was one of the members who reviewed expertly what has been done both in sorting and hashing. Overall it's more about assembly, finding missed compiler optimizations and balancing between correctness and distribution (in hashing in particular).
It was not revolutionary in a sense it hasn't found completely new approaches but converged to something incomprehensible for humans but relatively good for performance which proves the point that optimal programs are very inhuman.
Note that for instructions in sorting, removing them does not always lead to better performance, for example, instructions can run in parallel and the effect can be less profound. Benchmarks can lie and compiler could do something differently when recompiling the sort3 function which was changed. There was some evidence that the effect can come from the other side.
For hashing it was even funnier, very small strings up to 64 bit already used 3 instructions like add some constant -> multiply 64x64 -> xor upper/lower. For bigger ones the question becomes more complicated, that's why 9-16 was a better spot and it simplified from 2 multiplications to just one and a rotation. Distribution on real workloads was good, it almost passed smhasher and we decided it was good enough to try out in prod. We did not rollback as you can see from abseil :)
But even given all that, it was fascinating to watch how this system was searching and was able to find particular programs can be further simplified. Kudos to everyone involved, it's a great incremental change that can bring more results in the future.
Backward compatible implementations of newer standards constructs?
5 projects | /r/cpp_questions | 24 May 2023

Check out https://abseil.io. It offers absl::optional, which is a backport of std::optional.

What are some alternatives?

When comparing robin-hood-hashing and abseil-cpp you can also consider the following projects:

parallel-hashmap - A family of header-only, very fast and memory-friendly hashmap and btree containers.

Folly - An open-source C++ library developed and used at Facebook.

STL - MSVC's implementation of the C++ Standard Library.

Boost - Super-project for modularized Boost

robin-map - C++ implementation of a fast hash map and hash set using robin hood hashing

spdlog - Fast C++ logging library.

xxHash - Extremely fast non-cryptographic hash algorithm

Qt - Qt Base (Core, Gui, Widgets, Network, ...)

C++ Format - A modern formatting library

EASTL - Obsolete repo, please go to: https://github.com/electronicarts/EASTL

tracy - Frame profiler

BDE - Basic Development Environment - a set of foundational C++ libraries used at Bloomberg.

robin-hood-hashing vs parallel-hashmap abseil-cpp vs Folly robin-hood-hashing vs STL abseil-cpp vs Boost robin-hood-hashing vs robin-map abseil-cpp vs spdlog robin-hood-hashing vs xxHash abseil-cpp vs Qt robin-hood-hashing vs C++ Format abseil-cpp vs EASTL robin-hood-hashing vs tracy abseil-cpp vs BDE

Compare robin-hood-hashing vs abseil-cpp and see what are their differences.

robin-hood-hashing

abseil-cpp

robin-hood-hashing

abseil-cpp

What are some alternatives?