whoosh VS ClickHouse

Compare whoosh vs ClickHouse and see what are their differences.

whoosh

Pure-Python full-text search library (by mchaput)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
whoosh ClickHouse
5 207
520 33,712
- 2.4%
0.0 10.0
3 months ago about 4 hours ago
Python C++
GNU General Public License v3.0 or later Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

whoosh

Posts with mentions or reviews of whoosh. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-13.
  • Milli-py: Python bindings for Milli, an embeddable high-performance search engine
    4 projects | /r/Python | 13 Feb 2023
    The only other embeddable search engine I'm aware off, Whoosh, is brilliant but building the index was quite slow, and search performance degraded quite a lot as number of documents increase (performance is strictly a non-goal). Meilisearch was comparatively faster, I didn't like managing a server to get "just search" in my scripts and applications. However, their underlying engine Milli solves both issues I had, and all that was needed creating bindings for it.
  • Meilisearch v1.0 – the open-source Rust alternative to Algolia and Elasticsearch
    8 projects | news.ycombinator.com | 8 Feb 2023
    Is it really "just a single statically linked binary"?

    I'd love to use Meilisearch as you describe, but their so-called SDKs are just about for the search client, you still need the HTTP server listening on localhost.

    I would love to see something like SQLite based off Meilisearch (i.e. a fully selfcontained library like https://github.com/mchaput/whoosh). Do you know if such a thing exists?

  • Starting a KF Discord Bot
    3 projects | /r/KnowledgeFight | 24 Oct 2022
    Your best bet is to start using a proper search library rather than the simple loop with 'in' checks that you have now. A search lib will handle things like Unicode/ASCII similarities, removal of stop words, stemming, TF-IDF (and other) weighting, etc. and will be massively faster as well. Quite a few pages come up if you Google "python search engine", also Whoosh looks promising.

ClickHouse

Posts with mentions or reviews of ClickHouse. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-24.

What are some alternatives?

When comparing whoosh and ClickHouse you can also consider the following projects:

loki - Like Prometheus, but for logs.

duckdb - DuckDB is an in-process SQL OLAP Database Management System

Trino - Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

VictoriaMetrics - VictoriaMetrics: fast, cost-effective monitoring solution and time series database

TimescaleDB - An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.

arrow-datafusion - Apache Arrow DataFusion SQL Query Engine

RocksDB - A library that provides an embeddable, persistent key-value store for fast storage.

materialize - The data warehouse for operational workloads.

PostgreSQL - Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitting_a_Patch

TileDB - The Universal Storage Engine

Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Adminer - Database management in a single PHP file