Top 12 Search Engine Open-Source Projects
-
the-book-of-secret-knowledge
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
Project mention: Collection of manuals, cheatsheets,blogs,one-liners,CLI/web tools | news.ycombinator.com | 2021-02-19 -
MeiliSearch
Lightning Fast, Ultra Relevant, and Typo-Tolerant Search Engine
Project mention: ClickHouse as an alternative to Elasticsearch for log storage and analysis | news.ycombinator.com | 2021-03-02 -
Scout
Get performance insights in less than 4 minutes. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
-
Searx
Privacy-respecting metasearch engine
It’s not that big of a deal since all credit goes to searx a privacy respecting FOSS metasearch engine
-
Typesense
Fast, typo tolerant, fuzzy search engine for building delightful search experiences ⚡ 🔍
Project mention: Ask HN: What tangible benefits did you get from spending time on HN? | news.ycombinator.com | 2021-03-06I've been following HN for 10+ years, first as a lurker and then getting into the whole "build something people want" thing. Over the years, I've "launched" quite a few projects here. Some have failed, while others have succeeded far beyond my modest expectations. But in a pre Product Hunt era, launching on HN was the only way to get exposure to your product. Even today, for a number of highly technical projects, HN is the best place to get the word out.
While HN crowd has a reputation of being too cynical at times (the most famous example being the original "Show HN Dropbox"), over time, pre-empting how the HN crowd will potentially react and what kind of criticism a project might attract has actually helped me improve the product before launch!
> I mean one day you got traffic 100K on the website. Good. But just for one day.
My latest project, Typesense, which is an open source instant search engine (https://github.com/typesense/typesense) literally found traction only after posting here on HN. Yes, it was a ~50K single day traffic, but it had a permanent impact on the baseline traffic. So nothing is as useless as it looks :)
Apart from the value I've gotten out of all these Show HNs, there is an incredible amount of value in the comments on HN. In fact, I often just skip the main post and just skim through the comments. Also, unlike certain other forums, snarky/toxic comments are discourage and moderated.
-
Yacy
Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance
Project mention: Brave search engine: no tracking, profiling – may offer paid-for, no-ad version | news.ycombinator.com | 2021-03-03 -
Ambar
:mag: Ambar: Document Search Engine
There is also stuff like Mayan EDMS which is much more enterprise-oriented or Ambar which targeted more for individual users.
-
Gigablast
Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.
-
Weaviate
Weaviate is a cloud-native, modular, real-time vector search engine
Project mention: V1 of the open-source Vector Search Engine Weaviate released | reddit.com/r/Database | 2021-01-19Developer documentation: https://www.semi.technology/developers/weaviate/current/ Github: https://github.com/semi-technologies/weaviate
-
sist2
Lightning-fast file system indexer and search tool
-
Seeks
Seeks is a decentralized p2p websearch and collaborative tool.
There are two that I know of:
YaCy: https://github.com/yacy/yacy_search_server (functional)
Seeks: https://github.com/beniz/seeks (defunct)
---
There's also SearX, which isn't distributed but is a metasearch engine (pulls results from multiple search engines) that you can self-host.
-
dmt
dmt engine (pc, server or small computers)
Project mention: Are there are good tools to manage/search collections of documents, saved web pages etc? | reddit.com/r/DataHoarder | 2021-01-16Small update, went rereading this part you claimed is a bunch of nonsense: https://github.com/uniqpath/dmt/blob/main/help/ZETA_BACKGROUND.md
-
multiSearchHome
:mag_right: Local standalone html homepage to search in 175 search engine (duckduckgo, youtube, twitter, wikipedia, etc..) // FR___: Page d'accueil html autonome, pour chercher dans 175 moteurs de recherche.
Index
What are some of the best open-source Search Engine projects? This list will help you:
Project | Stars | |
---|---|---|
1 | the-book-of-secret-knowledge | 37,290 |
2 | MeiliSearch | 12,297 |
3 | Searx | 8,391 |
4 | Typesense | 5,082 |
5 | Yacy | 2,127 |
6 | Ambar | 1,686 |
7 | Gigablast | 1,205 |
8 | Weaviate | 496 |
9 | sist2 | 242 |
10 | Seeks | 219 |
11 | dmt | 22 |
12 | multiSearchHome | 2 |