roaring-rs VS tantivy

Compare roaring-rs vs tantivy and see what are their differences.

tantivy

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust (by quickwit-oss)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
roaring-rs tantivy
12 48
682 9,896
1.5% 3.8%
7.2 9.1
14 days ago 3 days ago
Rust Rust
Apache License 2.0 MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

roaring-rs

Posts with mentions or reviews of roaring-rs. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-08.

tantivy

Posts with mentions or reviews of tantivy. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-22.
  • SeekStorm VS tantivy - a user suggested alternative
    2 projects | 22 Mar 2024
  • What is Hybrid Search?
    6 projects | dev.to | 6 Feb 2024
    Tantivy - a full-text indexing library written in Rust. Has a great performance and featureset.
  • Tantivy – Fast, OSS full-text search library in Rust
    1 project | news.ycombinator.com | 20 Jan 2024
  • RAG Using Unstructured Data and Role of Knowledge Graphs
    4 projects | news.ycombinator.com | 17 Jan 2024
    By this I presume you mean build a search index that can retrieve results based on keywords? I know certain databases use Lucene to build a keyword-based index on top of unstructured blobs of data. Another alternative is to use Tantivy (https://github.com/quickwit-oss/tantivy), a Rust version of Lucene, if building search indices via Java isn't your cup of tea :)

    Both libraries offer multilingual support for keywords, I believe, so that's a benefit to vector search where multilingual embedding models are rather expensive.

  • Show HN: Quickwit – OSS Alternative to Elasticsearch, Splunk, Datadog
    4 projects | news.ycombinator.com | 7 Jan 2024
    We also implemented our schemaless columnar storage optimized for object storage.

    The inverted index and columnar storage are part of tantivy [0], which is the fastest search library out there. We maintain it and we decided to build the distributed engine on top of it.

    [0] tantivy github repo: https://github.com/quickwit-oss/tantivy

  • Pg_bm25: Elastic-Quality Full Text Search Inside Postgres
    6 projects | news.ycombinator.com | 8 Oct 2023
    The issue for geo search is here: https://github.com/quickwit-oss/tantivy/issues/44
  • Grimoire - A recipe management application.
    7 projects | /r/rust | 5 Oct 2023
    Search index : Custom-built using tantivy.
  • A Compressed Indexable Bitset
    6 projects | news.ycombinator.com | 1 Jul 2023
    The roaring bitmap variant is used only for the optional index (1 docid => 0 or 1 value) in the columnar storage (DocValues), not for the inverted index. Since this is used for aggregation, some queries may be a full scan.

    The inverted index in tantivy uses bitpacked values of 128 elements with a skip index on top.

    > I didn't follow the rest of your comment, select is what EF is good at, every other data structure needs a lot more scanning once you land on the right chunk. With BMI2 you can also use the PDEP instruction to accelerate the final select on a 64-bit block

    The select for the sparse codec is a [simple array index access](https://github.com/quickwit-oss/tantivy/blob/main/columnar/s...), that is hard to beat. Compression is not good near the 5k threshold though.

  • Job: Rust + Retrieval Systems at Etsy
    2 projects | /r/rust | 23 Jun 2023
    Hi /r/rust, I’m a SWE on Etsy’s Retrieval Systems team where we’re building a platform based on rust and tantivy (https://github.com/quickwit-oss/tantivy). We’re looking to bring two new engineers onto the team.
  • Announcing Velo - Your Rust-Powered Brainstorming and Note-Taking Tool
    4 projects | /r/rust | 19 Jun 2023
    Quick Search: Easily find specific notes with Velo's fuzzy-search feature, powered by tantivy. tantivy might have been a little overkill, but it was really easy to integrate.

What are some alternatives?

When comparing roaring-rs and tantivy you can also consider the following projects:

generic-array - Generic array types in Rust

sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.

array_tool - Array helpers for Rust's Vector and String types

surrealdb - A scalable, distributed, collaborative, document-graph database, for the realtime web

croaring-rs - Rust FFI wrapper for CRoaring

milli - Search engine library for Meilisearch ⚡️

nym - Manipulate files en masse using patterns.

MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

quickwit - Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.

base_custom - Rust implementation of custom numeric base conversion.

fselect - Find files with SQL-like queries