Tantivy – full-text search engine library inspired by Apache Lucene

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers
Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
www.nutrient.io
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. tantivy

    Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

    | Hm, I am interested, but I would love to use it as a rust lib and just have rust types instead of some json config...

    Yes that's how you use tantivy normally, not sure which json config you mean.

    `tantivy-cli` is more like a showcase, https://github.com/quickwit-oss/tantivy is the actual project.

  2. Nutrient

    Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.

    Nutrient logo
  3. paradedb

    Postgres for Search and Analytics

    Postgres's FTS is actually quite solid! You can get very far with just the built-in tsvector. The ranking could be improved, though, which was one of the reasons for creating pg_search in the first place: https://github.com/paradedb/paradedb/tree/dev/pg_search (disclaimer: I work on pg_search @ ParadeDB)

  4. tantivy-cli

    but the setup looks pretty simple [0].

    [0]: https://github.com/quickwit-oss/tantivy-cli

  5. progscrape

    progscrape.com source

    I built progscrape.com [1] on top of it -- it's a great library and insanely fast.

    I keep thinking I should write up my experiences with it. It was pretty much a drop-in, super-useful library and the team was very responsive with bug reports, of which there were very few.

    I'd recommend looking at it over Lucene for modern projects. I am a big fan.

    [1] https://github.com/progscrape/progscrape

  6. quickwit

    Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.

    https://github.com/quickwit-oss/quickwit

    Had a surprisingly good experience with combined power of Quickwit and Clickhouse for multilingual search pet project. Finally something usable for Chinese, Japanese, Korean

    https://quickwit.io/docs/guides/add-full-text-search-to-your...

    to_tsvector in PG never worked well for my use cases

    SELECT * FROM dump WHERE to_tsvector('english'::regconfig, hh_fullname) @@ to_tsquery('english'::regconfig, 'query');

    Wish them to succeed. Will automatically upvote any post Tantivy as keyword

  7. hound

    Lightning fast code searching made easy

    Another resource is a trigram search index (in Go) used by etsy/hound[0] based on an article (and code) from Russ Cox: Regular Expression Matching with a Trigram Index[1].

    [0] https://github.com/hound-search/hound

    [1] http://swtch.com/~rsc/regexp/regexp4.html

    Different use-cases for alternatives to Lucene depending on your needs.

  8. stork

    🔎 Impossibly fast web search, made for static sites.

    I ‘m using https://stork-search.net for my static website search, but it’s no longer maintained. So yeah, Tantivy would be a great candidate to replace it! :)

  9. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Adding search to static websites

    1 project | dev.to | 4 May 2024
  • WebAssembly Playground

    9 projects | news.ycombinator.com | 4 Feb 2024
  • A minimal working Rust / SDL2 / WASM browser game

    6 projects | news.ycombinator.com | 15 Jan 2024
  • Package All the Things

    4 projects | dev.to | 13 Oct 2023
  • Porting a local app to Web

    4 projects | /r/rust | 12 May 2023

Did you know that Rust is
the 5th most popular programming language
based on number of references?