whoosh VS Elasticsearch

Compare whoosh vs Elasticsearch and see what are their differences.

whoosh

Pure-Python full-text search library (by mchaput)

Elasticsearch

Free and Open, Distributed, RESTful Search Engine (by elastic)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
whoosh Elasticsearch
5 91
530 67,632
- 0.6%
0.0 10.0
4 months ago 4 days ago
Python Java
GNU General Public License v3.0 or later GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

whoosh

Posts with mentions or reviews of whoosh. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-13.
  • Milli-py: Python bindings for Milli, an embeddable high-performance search engine
    4 projects | /r/Python | 13 Feb 2023
    The only other embeddable search engine I'm aware off, Whoosh, is brilliant but building the index was quite slow, and search performance degraded quite a lot as number of documents increase (performance is strictly a non-goal). Meilisearch was comparatively faster, I didn't like managing a server to get "just search" in my scripts and applications. However, their underlying engine Milli solves both issues I had, and all that was needed creating bindings for it.
  • Meilisearch v1.0 – the open-source Rust alternative to Algolia and Elasticsearch
    8 projects | news.ycombinator.com | 8 Feb 2023
    Is it really "just a single statically linked binary"?

    I'd love to use Meilisearch as you describe, but their so-called SDKs are just about for the search client, you still need the HTTP server listening on localhost.

    I would love to see something like SQLite based off Meilisearch (i.e. a fully selfcontained library like https://github.com/mchaput/whoosh). Do you know if such a thing exists?

  • Faster Full Text Search
    1 project | /r/django | 5 Jan 2023
    For our full text search, we used whoosh, which works pretty well for moderately big amount of data.
  • We upgraded an old, 3PB large, Elasticsearch cluster without downtime
    1 project | news.ycombinator.com | 11 Nov 2022
    Nearly a decade ago (oh god) I converted some overdesigned five node ES mess to https://github.com/mchaput/whoosh. It's (obviously) not the fastest or anything, but it was more than good enough for low-dozens of GBs of mostly static data.
  • Starting a KF Discord Bot
    3 projects | /r/KnowledgeFight | 24 Oct 2022
    Your best bet is to start using a proper search library rather than the simple loop with 'in' checks that you have now. A search lib will handle things like Unicode/ASCII similarities, removal of stop words, stemming, TF-IDF (and other) weighting, etc. and will be massively faster as well. Quite a few pages come up if you Google "python search engine", also Whoosh looks promising.

Elasticsearch

Posts with mentions or reviews of Elasticsearch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-18.

What are some alternatives?

When comparing whoosh and Elasticsearch you can also consider the following projects:

Search Engine Parser - Lightweight package to query popular search engines and scrape for result titles, links and descriptions

OpenSearch - 🔎 Open source distributed and RESTful search engine.

pysolr - Pysolr — Python Solr client

Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]

elasticsearch-dsl-py - High level Python client for Elasticsearch

bleve - A modern text/numeric/geo-spatial/vector indexing library for go

query-builder - sql query builder library for crystal-lang

pgvector - Open-source vector similarity search for Postgres

query.cr - Query abstraction for Crystal Language. Used by active_record.cr library.

Whoosh

lunr.js - A bit like Solr, but much smaller and not as bright

MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow