Top 23 search-engine Open-Source Projects
-
Elasticsearch
Free and Open, Distributed, RESTful Search Engine
Project mention: AWS announces forks of Elasticsearch and Kibana | news.ycombinator.com | 2021-01-21Thank you! I had actually tried to find it after posting my comment and couldn't find it through their website. The day of the announcement Github hadn't been updated yet, and I gave up when I saw this didn't say where to look: https://github.com/elastic/elasticsearch/blob/master/LICENSE....
-
learn-anything
Organize world's knowledge, explore connections and curate learning paths
Project mention: Show HN: Vellum – An interactive list of nonfiction books reviewed by academics | news.ycombinator.com | 2021-02-20Your site looks awesome indeed! It reminds me of https://learn-anything.xyz but with a stronger focus on community.
Here you go! Only the titles and review URLs were taken from the original sites. The rest are from Google Books.
Science and Nature: https://gist.github.com/PizzaMyHeart/fd264434b2b92ba6dac0871...
LSE: https://gist.github.com/PizzaMyHeart/c07e162df141ced84509a3f...
The book tags were taken from the source HTML and seem to have been done very sloppily, especially the Nature ones, so you'd want to create your own tags anyway.
-
Scout
Get performance insights in less than 4 minutes. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
-
MeiliSearch
Lightning Fast, Ultra Relevant, and Typo-Tolerant Search Engine
Project mention: ClickHouse as an alternative to Elasticsearch for log storage and analysis | news.ycombinator.com | 2021-03-02https://github.com/meilisearch/MeiliSearch gets a lot of traction recently. There is also Sphinx and its fork https://manticoresearch.com/ - very lightweight and fast.
-
sonic
🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
Project mention: ClickHouse as an alternative to Elasticsearch for log storage and analysis | news.ycombinator.com | 2021-03-02I'm personally very fond of sonic [0] for full text search.
> Sonic can be used as a simple alternative to super-heavy and full-featured search backends such as Elasticsearch in some use-cases. It is capable of normalizing natural language search queries, auto-completing a search query and providing the most relevant results for a query....
> When reviewing Elasticsearch (ELS) and others, we found those were full-featured heavyweight systems that did not scale well with Crisp's freemium-based cost structure.
> At the end, we decided to build our own search backend, designed to be simple and lightweight on resources
-
Mailpile
A free & open modern, fast email client with user-friendly encryption and privacy features
Project mention: We need better open source e-mail clients! | reddit.com/r/privacytoolsIO | 2021-02-02Mailpile.is comes to mind. Their Twitter account is inactive since 2018, but they did fix some things on Github in November of 2020: https://github.com/mailpile/Mailpile/
-
flexsearch
Next-Generation full text search library for Browser and Node.js
In case you want to search your GraphQL layer using a search engine like FlexSearch, you might want to have a look at this plugin!
-
riot
Go Open Source, Distributed, Simple and efficient Search Engine (by go-ego)
-
Typesense
Fast, typo tolerant, fuzzy search engine for building delightful search experiences ⚡ 🔍
Project mention: ClickHouse as an alternative to Elasticsearch for log storage and analysis | news.ycombinator.com | 2021-03-02 -
tantivy
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
I created one for tantivy after reading your blog post. https://github.com/tantivy-search/tantivy/blob/main/ARCHITECTURE.md
-
Apache Solr
Apache Lucene and Solr open-source search software
Project mention: ClickHouse as an alternative to Elasticsearch for log storage and analysis | news.ycombinator.com | 2021-03-02I found Apache Luecene really easy to use, but haven't experienced it at scale:
-
Toshi
A full-text search engine in rust
Luckily there are faster and smaller alternatives in Rust for the ElasticSearch - Toshi[1], Meili[2] and Sonic[3]. In the age of Rust there is no need to use JVMs overhead.
[1] https://github.com/toshi-search/Toshi
-
magnetissimo
Web application that indexes all popular torrent sites, and saves it to the local database.
-
Magnetico
Autonomous (self-hosted) BitTorrent DHT search engine suite.
I'm using magnetico (https://github.com/boramalper/magnetico), a selfhosted bittorrent DHT scraper. After a few months of running I now have a little bit under 1M magnets in a ~3G database. SQLite is useful not just because I can query it with the full power of a relational database, but because I'm using its FTS capability for searching.
If I had to install a typical database or some search engine I would never have used it. It is more than enough for what I'm using it for.
-
TNTSearch
A fully featured full text search engine written in PHP
-
Ambar
:mag: Ambar: Document Search Engine
There is also stuff like Mayan EDMS which is much more enterprise-oriented or Ambar which targeted more for individual users.
-
tinysearch
🔍 Tiny, full-text search engine for static websites built with Rust and Wasm
-
whoogle-search
A self-hosted, ad-free, privacy-respecting metasearch engine
benbusby/whoogle-search: A self-hosted, ad-free, privacy-respecting metasearch engine (github.com)
-
search-plugins
Search plugins for the search feature
Project mention: Thank you Google DMCA notice for highlighting links to the real contents! | reddit.com/r/Piracy | 2021-02-18 -
rats-search
BitTorrent P2P multi-platform search engine for Desktop and Web servers with integrated torrent client.
Project mention: Rats on The Boat - BitTorrent search engine v1.5.0 | reddit.com/r/Piracy | 2021-01-14 -
DuckieTV
A web application built with AngularJS to track your favorite tv-shows with semi-automagic torrent integration
Project mention: Show posters doesn't show or refresh on Calendar | reddit.com/r/duckietv | 2021-03-01Work was stated a while ago to see if Dtv could be switched to fetching posters from TMDB, follow https://github.com/SchizoDuckie/DuckieTV/issues/1140 for progress on that topic.
-
Infinispan
Infinispan is an open source data grid platform and highly scalable NoSQL cloud data store.
-
bertsearch
Elasticsearch with BERT for advanced document search.
Project mention: The Weakness of Full Text Searching (2008) [pdf] | news.ycombinator.com | 2020-12-24There are a few things in this space that might be interesting:
- https://github.com/Hironsan/bertsearch
- https://github.com/hanxiao/bert-as-service
There was another I can't find right now that looked more polished/professional. But, in short, no it's pretty easy to setup. Just need a machine with a pretty big disk and be ok with an index latency. If you're using elastic search then you're already there on both accounts!
-
SearchCop
Search engine like fulltext query support for ActiveRecord
Index
What are some of the best open-source search-engine projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Elasticsearch | 53,853 |
2 | learn-anything | 12,801 |
3 | MeiliSearch | 12,297 |
4 | sonic | 10,839 |
5 | Mailpile | 8,342 |
6 | flexsearch | 6,882 |
7 | riot | 5,638 |
8 | Typesense | 5,082 |
9 | tantivy | 4,528 |
10 | Apache Solr | 4,095 |
11 | Toshi | 3,068 |
12 | magnetissimo | 2,492 |
13 | Magnetico | 2,441 |
14 | TNTSearch | 2,308 |
15 | Ambar | 1,686 |
16 | tinysearch | 1,475 |
17 | whoogle-search | 1,467 |
18 | search-plugins | 1,411 |
19 | rats-search | 1,031 |
20 | DuckieTV | 940 |
21 | Infinispan | 858 |
22 | bertsearch | 680 |
23 | SearchCop | 659 |