tantivy
ht
Our great sponsors
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tantivy
-
SeekStorm VS tantivy - a user suggested alternative
2 projects | 22 Mar 2024
-
What is Hybrid Search?
Tantivy - a full-text indexing library written in Rust. Has a great performance and featureset.
- Tantivy – Fast, OSS full-text search library in Rust
-
RAG Using Unstructured Data and Role of Knowledge Graphs
By this I presume you mean build a search index that can retrieve results based on keywords? I know certain databases use Lucene to build a keyword-based index on top of unstructured blobs of data. Another alternative is to use Tantivy (https://github.com/quickwit-oss/tantivy), a Rust version of Lucene, if building search indices via Java isn't your cup of tea :)
Both libraries offer multilingual support for keywords, I believe, so that's a benefit to vector search where multilingual embedding models are rather expensive.
-
Show HN: Quickwit – OSS Alternative to Elasticsearch, Splunk, Datadog
We also implemented our schemaless columnar storage optimized for object storage.
The inverted index and columnar storage are part of tantivy [0], which is the fastest search library out there. We maintain it and we decided to build the distributed engine on top of it.
[0] tantivy github repo: https://github.com/quickwit-oss/tantivy
-
Pg_bm25: Elastic-Quality Full Text Search Inside Postgres
The issue for geo search is here: https://github.com/quickwit-oss/tantivy/issues/44
-
Grimoire - A recipe management application.
Search index : Custom-built using tantivy.
-
A Compressed Indexable Bitset
The roaring bitmap variant is used only for the optional index (1 docid => 0 or 1 value) in the columnar storage (DocValues), not for the inverted index. Since this is used for aggregation, some queries may be a full scan.
The inverted index in tantivy uses bitpacked values of 128 elements with a skip index on top.
> I didn't follow the rest of your comment, select is what EF is good at, every other data structure needs a lot more scanning once you land on the right chunk. With BMI2 you can also use the PDEP instruction to accelerate the final select on a 64-bit block
The select for the sparse codec is a [simple array index access](https://github.com/quickwit-oss/tantivy/blob/main/columnar/s...), that is hard to beat. Compression is not good near the 5k threshold though.
-
Job: Rust + Retrieval Systems at Etsy
Hi /r/rust, I’m a SWE on Etsy’s Retrieval Systems team where we’re building a platform based on rust and tantivy (https://github.com/quickwit-oss/tantivy). We’re looking to bring two new engineers onto the team.
-
Announcing Velo - Your Rust-Powered Brainstorming and Note-Taking Tool
Quick Search: Easily find specific notes with Velo's fuzzy-search feature, powered by tantivy. tantivy might have been a little overkill, but it was really easy to integrate.
ht
-
Curl is now a CVE Numbering Authority
No need to use curl, make HTTP requests great again with https://github.com/ducaale/xh
-
Why people in Google hate Go?
Except when you actually enjoy things being fast. For example, HTTPie easily adds 0.5-1s delay to every request because it's written in Python, especially on the first invocation. xh (https://github.com/ducaale/xh), on the other hand, starts immediately because it's written in Rust. I very much like this trend.
-
HTTPie Desktop: cross-platform API testing client for humans
HTTPie is great and was a big improvement for me over cURL.
However, I ended up switching to xh[1] as it's significantly faster and I prefer its output.
https://github.com/ducaale/xh
-
Tell HN: Postman just wiped all my stuff
No, but unless portability is a concern or you're massively familiar with curl, you might want to consider xh. It's much more intuitive.
https://github.com/ducaale/xh
-
🐚🦀Comandos shell reescritos em Rust
xh
- Insomnia REST client now requires an account
-
The curl-wget Venn diagram
wget on the other hand, automatically converts the ñ to UTF-8 hex and resolves the link perfectly.
I've searched the curl manpage and couldn't find a way to solve this. Please help.
I'm having to use `xh --curl` [1] to "fix" the links before I pass them to curl.
[1] https://github.com/ducaale/xh
-
Get better with Vim one tip at a time
Very nice, you should add xh to the User-Agents though.
-
I Could Rewrite Curl
While not a rewrite - one recent tool for making http requests which i quite enjoy is:
https://github.com/ducaale/xh
It's basically python httpie rewritten in rust. my only gripe is that i keep forgetting that it exists - and that "xh" is for http and "xhs" is for https.
So i frequently end up with curl anyway:)
- xh: Friendly and fast tool for sending HTTP requests (HTTPie in Rust)
What are some alternatives?
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
rustdesk - An open-source remote desktop, and alternative to TeamViewer.
surrealdb - A scalable, distributed, collaborative, document-graph database, for the realtime web
htmlq - Like jq, but for HTML.
milli - Search engine library for Meilisearch ⚡️
httpie - 🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
gitoxide - An idiomatic, lean, fast & safe pure Rust implementation of Git
quickwit - Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
tty-share - Share your linux or osx terminal over the Internet.
fselect - Find files with SQL-like queries
thgtoa - The Hitchhiker’s Guide to Online Anonymity