Top 12 Search Engine Open-Source Projects
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.Project mention: Collection of manuals, cheatsheets,blogs,one-liners,CLI/web tools | news.ycombinator.com | 2021-02-19
Lightning Fast, Ultra Relevant, and Typo-Tolerant Search EngineProject mention: ClickHouse as an alternative to Elasticsearch for log storage and analysis | news.ycombinator.com | 2021-03-02
Get performance insights in less than 4 minutes. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
Privacy-respecting metasearch engineProject mention: Surfing web inside a terminal, because why not? | dev.to | 2021-03-06
It’s not that big of a deal since all credit goes to searx a privacy respecting FOSS metasearch engine
Fast, typo tolerant, fuzzy search engine for building delightful search experiences ⚡ 🔍Project mention: Ask HN: What tangible benefits did you get from spending time on HN? | news.ycombinator.com | 2021-03-06
I've been following HN for 10+ years, first as a lurker and then getting into the whole "build something people want" thing. Over the years, I've "launched" quite a few projects here. Some have failed, while others have succeeded far beyond my modest expectations. But in a pre Product Hunt era, launching on HN was the only way to get exposure to your product. Even today, for a number of highly technical projects, HN is the best place to get the word out.
While HN crowd has a reputation of being too cynical at times (the most famous example being the original "Show HN Dropbox"), over time, pre-empting how the HN crowd will potentially react and what kind of criticism a project might attract has actually helped me improve the product before launch!
> I mean one day you got traffic 100K on the website. Good. But just for one day.
My latest project, Typesense, which is an open source instant search engine (https://github.com/typesense/typesense) literally found traction only after posting here on HN. Yes, it was a ~50K single day traffic, but it had a permanent impact on the baseline traffic. So nothing is as useless as it looks :)
Apart from the value I've gotten out of all these Show HNs, there is an incredible amount of value in the comments on HN. In fact, I often just skip the main post and just skim through the comments. Also, unlike certain other forums, snarky/toxic comments are discourage and moderated.
Distributed Peer-to-Peer Web Search Engine and Intranet Search ApplianceProject mention: Brave search engine: no tracking, profiling – may offer paid-for, no-ad version | news.ycombinator.com | 2021-03-03
:mag: Ambar: Document Search EngineProject mention: Document Automation Software | reddit.com/r/datahorder | 2021-02-20
There is also stuff like Mayan EDMS which is much more enterprise-oriented or Ambar which targeted more for individual users.
Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.
Weaviate is a cloud-native, modular, real-time vector search engineProject mention: V1 of the open-source Vector Search Engine Weaviate released | reddit.com/r/Database | 2021-01-19
Developer documentation: https://www.semi.technology/developers/weaviate/current/ Github: https://github.com/semi-technologies/weaviate
Lightning-fast file system indexer and search toolProject mention: Google Books for your Personal Collection? | reddit.com/r/selfhosted | 2021-01-29
Seeks is a decentralized p2p websearch and collaborative tool.Project mention: Startpage: The most private search engine | news.ycombinator.com | 2021-01-10
There are two that I know of:
YaCy: https://github.com/yacy/yacy_search_server (functional)
Seeks: https://github.com/beniz/seeks (defunct)
There's also SearX, which isn't distributed but is a metasearch engine (pulls results from multiple search engines) that you can self-host.
dmt engine (pc, server or small computers)Project mention: Are there are good tools to manage/search collections of documents, saved web pages etc? | reddit.com/r/DataHoarder | 2021-01-16
Small update, went rereading this part you claimed is a bunch of nonsense: https://github.com/uniqpath/dmt/blob/main/help/ZETA_BACKGROUND.md
:mag_right: Local standalone html homepage to search in 175 search engine (duckduckgo, youtube, twitter, wikipedia, etc..) // FR___: Page d'accueil html autonome, pour chercher dans 175 moteurs de recherche.
What are some of the best open-source Search Engine projects? This list will help you: