Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more →
Searcharray Alternatives
Similar projects and alternatives to searcharray
-
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
-
-
auth
Discontinued Fully open source, End to End Encrypted alternative to Google Photos and Apple Photos [Moved to: https://github.com/ente-io/ente]
-
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
-
-
searx
Discontinued Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx] (by asciimoo)
-
-
-
-
-
lofi-dx
Discontinued A small, fast, local-first, searchable index for client side apps written in Typescript. Supports required, negated, and phrase queries.
-
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
searcharray discussion
searcharray reviews and mentions
- Show HN: SearchArray – Full text search that feels like a NumPy array
-
Farewell Pandas, and thanks for all the fish
GeoPandas is one popular library (https://geopandas.org/en/stable/)
I added a column type for full text search, searching tokenized text (https://github.com/softwaredoug/searcharray)
The pandas support for extensions is very robust with extensive testing to make sure you can do all the data aggregations functionality you expect in and pandas column.
-
A search engine in 80 lines of Python
This is really cool. I have a pretty fast BM25 search engine in Pandas I've been working on for local testing.
https://github.com/softwaredoug/searcharray
Why Pandas? Because BM25 is one thing, but you also want to combine with other factors (recency, popularity, etc) easily computed in pandas / numpy...
-
Are we at peak vector database?
You might be interested in
https://github.com/softwaredoug/searcharray
- SearchArray turns Pandas string columns into a term index
-
Show HN: SearchArray – Text Search in Pandas
I've long worked with Lucene based search engines like Solr and Elasticsearch. Anytime I need to experiment with relevance ranking in these systems, I'm exhausted by needing to set them up and work with something so disjoint from normal data tooling.
Further - the underlying ranking is buried in needless mystique (you know a boolean should query, sums the scores, right?). You shouldn't need to read a book (like Relevant Search ;) ) to unpack mystique that's really basic math.
Why not just let people build ranking systems with vectorized math in a numpy/pandas stack?
SearchArray lets anyone build a search prototype in Pandas. Typically building / experimenting with a smaller labeled dataset. If it works out, you can transfer it relatively easily to Elasticsearch or Solr for implementation.
SearchArray is a pandas extension array that creates an underlying search index for BM25 term/phrase based searching.
It's not quite done (will it ever be?) but its getting far enough along to be useful. So feedback is very welcome.
https://github.com/softwaredoug/searcharray
-
A note from our sponsor - InfluxDB
influxdata.com | 19 Apr 2025
Stats
softwaredoug/searcharray is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of searcharray is Python.