Pure-Python full-text search library (by mchaput)

Whoosh Alternatives

Similar projects and alternatives to whoosh

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better whoosh alternative or higher similarity.

whoosh reviews and mentions

Posts with mentions or reviews of whoosh. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-13.
  • Milli-py: Python bindings for Milli, an embeddable high-performance search engine
    4 projects | /r/Python | 13 Feb 2023
    The only other embeddable search engine I'm aware off, Whoosh, is brilliant but building the index was quite slow, and search performance degraded quite a lot as number of documents increase (performance is strictly a non-goal). Meilisearch was comparatively faster, I didn't like managing a server to get "just search" in my scripts and applications. However, their underlying engine Milli solves both issues I had, and all that was needed creating bindings for it.
  • Meilisearch v1.0 – the open-source Rust alternative to Algolia and Elasticsearch
    8 projects | news.ycombinator.com | 8 Feb 2023
    Is it really "just a single statically linked binary"?

    I'd love to use Meilisearch as you describe, but their so-called SDKs are just about for the search client, you still need the HTTP server listening on localhost.

    I would love to see something like SQLite based off Meilisearch (i.e. a fully selfcontained library like https://github.com/mchaput/whoosh). Do you know if such a thing exists?

  • Faster Full Text Search
    1 project | /r/django | 5 Jan 2023
    For our full text search, we used whoosh, which works pretty well for moderately big amount of data.
  • We upgraded an old, 3PB large, Elasticsearch cluster without downtime
    1 project | news.ycombinator.com | 11 Nov 2022
    Nearly a decade ago (oh god) I converted some overdesigned five node ES mess to https://github.com/mchaput/whoosh. It's (obviously) not the fastest or anything, but it was more than good enough for low-dozens of GBs of mostly static data.
  • Starting a KF Discord Bot
    3 projects | /r/KnowledgeFight | 24 Oct 2022
    Your best bet is to start using a proper search library rather than the simple loop with 'in' checks that you have now. A search lib will handle things like Unicode/ASCII similarities, removal of stop words, stemming, TF-IDF (and other) weighting, etc. and will be massively faster as well. Quite a few pages come up if you Google "python search engine", also Whoosh looks promising.
  • A note from our sponsor - SaaSHub
    www.saashub.com | 18 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →


Basic whoosh repo stats
5 months ago

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives