searcharray

Full text search in your Pandas dataframe (by softwaredoug)

Searcharray Alternatives

Similar projects and alternatives to searcharray

  1. Pandas

    424 searcharray VS Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. txtai

    385 searcharray VS txtai

    💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

  4. marqo

    117 searcharray VS marqo

    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

  5. pgvector

    Open-source vector similarity search for Postgres

  6. auth

    49 searcharray VS auth

    Discontinued Fully open source, End to End Encrypted alternative to Google Photos and Apple Photos [Moved to: https://github.com/ente-io/ente]

  7. Internet-Places-Database

    Database of Internet places. Mostly domains

  8. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  9. ibis

    32 searcharray VS ibis

    the portable Python dataframe library

  10. lantern

    PostgreSQL vector database extension for building AI applications

  11. narwhals

    Lightweight and extensible compatibility layer between dataframe libraries!

  12. searx

    3 searcharray VS searx

    Discontinued Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx] (by asciimoo)

  13. ibish

    A Unix backend for Ibis

  14. PaddleNLP

    2 searcharray VS PaddleNLP

    Easy-to-use and powerful LLM and SLM library with awesome model zoo.

  15. sliders

    3 searcharray VS sliders

    Concept Sliders for Precise Control of Diffusion Models

  16. www.mechaelephant.com

    website for www.mechaelephant.com

  17. lofi-dx

    2 searcharray VS lofi-dx

    Discontinued A small, fast, local-first, searchable index for client side apps written in Typescript. Supports required, negated, and phrase queries.

  18. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better searcharray alternative or higher similarity.

searcharray discussion

Log in or Post with

searcharray reviews and mentions

Posts with mentions or reviews of searcharray. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-08-29.
  • Show HN: SearchArray – Full text search that feels like a NumPy array
    1 project | news.ycombinator.com | 15 Apr 2025
  • Farewell Pandas, and thanks for all the fish
    5 projects | news.ycombinator.com | 29 Aug 2024
    GeoPandas is one popular library (https://geopandas.org/en/stable/)

    I added a column type for full text search, searching tokenized text (https://github.com/softwaredoug/searcharray)

    The pandas support for extensions is very robust with extensive testing to make sure you can do all the data aggregations functionality you expect in and pandas column.

  • A search engine in 80 lines of Python
    6 projects | news.ycombinator.com | 7 Feb 2024
    This is really cool. I have a pretty fast BM25 search engine in Pandas I've been working on for local testing.

    https://github.com/softwaredoug/searcharray

    Why Pandas? Because BM25 is one thing, but you also want to combine with other factors (recency, popularity, etc) easily computed in pandas / numpy...

  • Are we at peak vector database?
    8 projects | news.ycombinator.com | 25 Jan 2024
    You might be interested in

    https://github.com/softwaredoug/searcharray

  • SearchArray turns Pandas string columns into a term index
    1 project | news.ycombinator.com | 27 Dec 2023
  • Show HN: SearchArray – Text Search in Pandas
    1 project | news.ycombinator.com | 19 Nov 2023
    I've long worked with Lucene based search engines like Solr and Elasticsearch. Anytime I need to experiment with relevance ranking in these systems, I'm exhausted by needing to set them up and work with something so disjoint from normal data tooling.

    Further - the underlying ranking is buried in needless mystique (you know a boolean should query, sums the scores, right?). You shouldn't need to read a book (like Relevant Search ;) ) to unpack mystique that's really basic math.

    Why not just let people build ranking systems with vectorized math in a numpy/pandas stack?

    SearchArray lets anyone build a search prototype in Pandas. Typically building / experimenting with a smaller labeled dataset. If it works out, you can transfer it relatively easily to Elasticsearch or Solr for implementation.

    SearchArray is a pandas extension array that creates an underlying search index for BM25 term/phrase based searching.

    It's not quite done (will it ever be?) but its getting far enough along to be useful. So feedback is very welcome.

    https://github.com/softwaredoug/searcharray

  • A note from our sponsor - InfluxDB
    influxdata.com | 19 Apr 2025
    Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more →

Stats

Basic searcharray repo stats
6
223
9.4
4 months ago

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?