FastHash
oxide
FastHash | oxide | |
---|---|---|
3 | 9 | |
19 | 276 | |
- | - | |
6.7 | 0.0 | |
6 months ago | over 1 year ago | |
C# | Rust | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
FastHash
-
The Smallest Hash Table
I've implemented your RecSplit method in my MPHF benchmark library (written in C#). The suite is not yet public, but I do want to say thank you for your fantastic method/code/paper. In my own rabbithole research, I stumbled upon several artifacts of your rabbithole trail. Most notably the stuff on StackOverflow, which helped my own research.
I've releaed a set og fast hash functions[1] to help gain an understanding of speed vs. quality. My biggest takeaway is that most generic hash functions can be specialized for integer inputs[2], which often reduce latency by quite a lot, making MPFH more attractive over simple iteration on small sets, as the overhead of hashing is considerably smaller.
[1] https://github.com/Genbox/FastHash
[2] https://github.com/Genbox/FastHash/blob/master/src/FastHash/...
-
The quick and practical “MSI” hash table
It was made (by Sanmayce) to optimize for instruction-level pipelining, and use the fact that modern CPUs have multiple execution ports. But due to those changes, it is not compatible with FNV1a anymore.
The trick of reading in stripes is employed by many of the fastest hashes. It is kinda funny to see how one author prefers a switch case over for loops, where others prefer while loops. The differences can sometimes have a big impact on what optimizations the compiler decides to use.
[1] https://github.com/Genbox/FastHash/blob/master/src/FastHash....
-
Ask HN: What Are You Working On? (August 2022)
At the moment I'm working on FastHash[1], a pet project of mine to port a few high-performance non-cryptographic hash functions to C#.
I'm also trying to build FastLinq, a value-by-reference Language Integrated Query (LINQ) optimized for high-performance scenarios. It is kind of a weird mix as LINQ in .NET is known for its high overhead.
Finally, I'm working on an Office setting synchronization application. I heard a podcast with Paul Thurrott complaining about the lack of sync solutions, so I thought I would do one for fun.
[1] https://github.com/Genbox/FastHash
oxide
-
SQLite Functions for Working with JSON
Sorry about that, it's just a shortcut for https://github.com/fcoury/oxide.
- Ask HN: What are your “scratch own itch” projects?
- Looking for paid advanced Rust tutoring
- OxideDB - Teach your PostgreSQL database how to speak MongoDB Wire Protocol.
- Show HN: OxideDB – Teach PostgreSQL Database How to Speak MongoDB Wire Protocol
-
Ask HN: What Are You Working On? (August 2022)
Mostly on MongoDB to PostgreSQL translation server: http://oxidedb.com or https://demo.oxidedb.com.
I have been wanting to dive deep into a Rust project and the challenge of implementing the MongoDB protocol and then translating it into some sort of SQL counterpart was the first thing that really clicked and got me excited enough to get me working on it nonstop for 3 weeks now.
Some backstory:
I have created a product that relies on MongoDB for a document store but doesn’t really need any of the distributed features to really justify having a hosted MongoDB or DocumentDB instance. Now that we’re trying to turn this into a product, we’re seeing that some companies have a little bit of resistance around managing yet another database. Most of our clients already have and manage PostgreSQL in one form or another. I knew that PostgreSQL already offered first class JSON support, but I didn’t want to rewrite the application data layer from scratch if I could avoid it. That’s when I started researching if there was a “proxy” that would translate the MongoDB protocol - that I was completely ignorant about - into PostgreSQL. To my surprise there was nothing ready for production use but I found MangoDB that later on became FerretDB. I delved into the code and was in love with the idea. The team around is really nice, but I found that they had greater ambitions - they basically wanted to offer multiple backends, namely Tigris, on top of PostgreSQL.
On the other hand, I have been waiting to find an excuse to delve deeply into the rust ecosystem but never really found something I was passionate about until I had the idea of challenging myself to see if I could learn about the protocol that MongoDB uses by relying on their public documentation and the hints I found on FerretDB.
Another thing I added to my toolbelt while developing this was about creating parsers. In order to transform MongoDB JSON to SQL queries, I ported an existing library from the MongoDB team from PEG.js to pest.rs!
It’s in very early stages, and it’s work from someone that is not yet super comfortable with the stack so keep in mind this is the beginning of a journey for me that I embarked out of pure joy on getting a tiny bit better on rust and making things click internally.
- OxideDB – Teach PostgreSQL Database How to Speak MongoDB Wire Protocol
What are some alternatives?
PicoPico - Pico-8 Player
rmkit - | remarkable app framework | https://rmkit.dev
reframe - LeapTable 🦘- The fastest way to build, deploy, and manage LLM-powered agents on tabular data (dataframes, SQL tables and Spreadsheets). [Moved to: https://github.com/peterwnjenga/leaptable]
skeleton - A fully featured UI toolkit for Svelte + Tailwind. [Moved to: https://github.com/skeletonlabs/skeleton]
smhasher - Hash function quality and speed tests
pyroscope-rs - Pyroscope Profiler for Rust. Profile your Rust applications.
needle - A CLI tool that finds a needle (opening/intro and ending/credits) in a haystack (TV or anime episode).
aHash - aHash is a non-cryptographic hashing algorithm that uses the AES hardware instruction
txtai - 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows