Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 search-engine Open-Source Projects
-
MeiliSearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
sonic
🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
-
Typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
-
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
-
manticoresearch
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
-
orama
🌌 Fast, dependency-free, full-text and vector search engine with typo tolerance, filters, facets, stemming, and more. Works with any JavaScript runtime, browser, server, service!
-
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
awesome-hacker-search-engines
A curated list of awesome search engines useful during Penetration testing, Vulnerability assessments, Red/Blue Team operations, Bug Bounty and more
-
quickwit
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
-
RediSearch
A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.
-
LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
You could check out their GitHub and see what is going on https://github.com/elastic/elasticsearch/issues
We needed to introduce a new service for search. As we settled on using meilisearch, we needed a way to sync updates on our models with the records in meilisearch. We could've continued to use callbacks but we needed something better.
Sonic - a project written in Rust, uses custom network communication protocol for fast communication between the client and the server.
There are actually plenty of non-ES products that are way easier to integrate and tune (and get better results with less effort).
- Typesense (https://github.com/typesense/typesense)
- Algolia
- Google Programmable Search Engine (https://programmablesearchengine.google.com/about/)
Project mention: Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant | dev.to | 2024-04-25I took Qdrant for this project. The reason was that Qdrant stands for high-performance vector search, the best choice against use cases like finding similar function calls based on semantic similarity. Qdrant is not only powerful but also scalable to support a variety of advanced search features that are greatly useful to nuanced caching mechanisms like ours.
Project mention: SeekStorm VS tantivy - a user suggested alternative | libhunt.com/r/SeekStorm | 2024-03-22
Project mention: pgvecto.rs alternatives - qdrant and Weaviate | libhunt.com/r/pgvecto.rs | 2024-03-13
Project mention: [Self Hosted] Selbst gehostete Mailserver: mailcow, mailinabox, mailU... hast du sie (eingehend) getestet? Ihre Meinung und Ratschläge hier, danke! | /r/aufdeutsch | 2023-04-27
Note, you need to do it in the root folder of a clone from https://github.com/manticoresoftware/manticoresearch
Project mention: Sky's the Limit! Supercharging Your Astro Blog with Orama, the Ultimate Stargazing Search Engine! | dev.to | 2023-08-03Let's break into the steps to utilize Orama and analyze how it works. I won't dig into the technical stuff because, hey, it's an open-source project, which means you can easily peek at the source code, no problemo!
txtai is an all-in-one embeddings database for semantic search, LLM orchestration and language model workflows.
Project mention: Show HN: I've built a locally running perplexity clone | news.ycombinator.com | 2024-04-03
We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo
Project mention: Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more... | /r/rust | 2023-06-20I don't think you have an active project that addresses all those use cases. There was an attempt in Rust with Toshi that is built on top of tantivy, but the project seems to have stalled.
search-engine related posts
- Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant
- Show HN: I've built a locally running perplexity clone
- Myscaledb: Open-source SQL vector database to build AI apps using SQL
- Show HN: High-Performance SQL Vector Database MyScaleDB Goes Open Source
-
SeekStorm VS tantivy - a user suggested alternative
2 projects | 22 Mar 2024
-
pgvecto.rs alternatives - qdrant and Weaviate
3 projects | 13 Mar 2024
- Open-source Rust-based RAG
-
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024
Index
What are some of the best open-source search-engine projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Elasticsearch | 67,531 |
2 | MeiliSearch | 43,284 |
3 | sonic | 19,419 |
4 | Typesense | 17,876 |
5 | qdrant | 17,839 |
6 | flexsearch | 11,839 |
7 | PaddleNLP | 11,386 |
8 | tantivy | 9,839 |
9 | Weaviate | 9,436 |
10 | whoogle-search | 8,789 |
11 | Mailpile | 8,778 |
12 | OpenSearch | 8,685 |
13 | manticoresearch | 8,289 |
14 | orama | 8,018 |
15 | txtai | 6,953 |
16 | awesome-hacker-search-engines | 6,664 |
17 | quickwit | 6,052 |
18 | vespa | 5,336 |
19 | RediSearch | 5,189 |
20 | LLocalSearch | 4,810 |
21 | Apache Solr | 4,365 |
22 | marqo | 4,111 |
23 | Toshi | 4,110 |
Sponsored