Search Engines

Open-source projects categorized as Search Engines

Top 16 Search Engine Open-Source Projects

  • the-book-of-secret-knowledge

    A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

  • Project mention: Cyber Security iPhone Application Idea | /r/iOSDevelopment | 2023-07-03

    8. Security Knowledge Base: - Utilize resources like The-book-of-secret-knowledge (e.g., https://github.com/trimstray/the-book-of-secret-knowledge) and Awesome-Hacking (e.g., https://github.com/Hack-with-Github/Awesome-Hacking) to build a knowledge base. - Extract relevant security information and create a structured knowledge base within SecurIoT. - Implement functionality to query and retrieve security information from the knowledge base. - Thoroughly test the knowledge base integration, ensuring accurate retrieval of security knowledge.

  • MeiliSearch

    A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

  • Project mention: Publish/Subscribe with Sidekiq | dev.to | 2024-02-21

    We needed to introduce a new service for search. As we settled on using meilisearch, we needed a way to sync updates on our models with the records in meilisearch. We could've continued to use callbacks but we needed something better.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Typesense

    Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

  • Project mention: Website Search Hurts My Feelings | news.ycombinator.com | 2023-12-26

    There are actually plenty of non-ES products that are way easier to integrate and tune (and get better results with less effort).

    - Typesense (https://github.com/typesense/typesense)

    - Algolia

    - Google Programmable Search Engine (https://programmablesearchengine.google.com/about/)

  • qdrant

    Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

  • Project mention: Ask HN: Has Anyone Trained a personal LLM using their personal notes? | news.ycombinator.com | 2024-04-03

    I'm currently looking to implement locally, using QDrant [1] for instance.

    I'm just playing around, but it makes sense to have a runnable example for our users at work too :) [2].

    [1]. https://qdrant.tech/

  • Yacy

    Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance

  • Project mention: New ways we're tackling spammy, low-quality content on Search | news.ycombinator.com | 2024-03-07
  • Gigablast

    Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.

  • OnionSearch

    OnionSearch is a script that scrapes urls on different .onion search engines.

  • Project mention: Launching Osint Industries: Discover Your Digital Footprint in Realtime | news.ycombinator.com | 2023-08-09

    Greetings, HN community. We are excited to share OSINT Industries, a platform dedicated to real-time open-source intelligence (OSINT) pertaining to phone numbers and emails.

    About OSINT Industries:

    Realtime Analysis: We provide an up-to-the-moment enrichment tool for emails, and phone numbers.

    Real-Time Intelligence: We refrain from using databases. Every piece of data is fetched in real-time, ensuring its accuracy and timeliness. None of the queries or results are stored.

    Extensive Reach: Our tool can identify associated accounts linked to a particular email or phone number from over 200 websites.

    Detailed Insights: Beyond basic association, our system can pull additional data points, such as images, map locations, and more.

    Pedigree: Our foundation is built upon proven tools our team made in the past like Holehe (https://github.com/megadose/holehe), GHunt (https://github.com/mxrch/GHunt), and onionsearch (https://github.com/megadose/OnionSearch).

    User Base: Within 3 months of our inception, we've got over 350k registered users.

    Trust & Reliability: Our tool has been integrated by various global law enforcement agencies, showcasing its reliability and utility.

    Try the tool for free to discover the digital footprint of your email and phone number. The first 5 searches are free: https://osint.industries

    We offer API access to enterprises, if you're interested in that contact me on [email protected].

    As our tool deals with data that some may view as sensitive, I think it is also important to link our policies here which govern our ethics, and data processing.

    Trust & Safety (our ethics): https://osint.industries/trust

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • sist2

    Lightning-fast file system indexer and search tool

  • Project mention: Better option then filebrowser to share files | /r/OpenMediaVault | 2023-06-11

    Quickly Googling for a docker indexer and search app I turned up Sist2, that on the surface looks like might fit your needs. I don't have an appropriate data store to run it against, so I can't speak to its indexing speed or efficacy. However, the developer does have an accessible demo to try, and the front end at least appears to function well.

  • domains

    World’s single largest Internet domains dataset

  • Project mention: There are only 2 .yahoo Internet domains | news.ycombinator.com | 2023-06-13
  • dark-web-osint-tools

    OSINT Tools for the Dark Web

  • Nuclia DB

    NucliaDB, The AI Search database for RAG

  • Project mention: Tantivy 0.20 is released: Schemaless column store, Schemaless aggregations, Phrase prefix queries, Percentiles, and more... | /r/rust | 2023-06-20

    You have also NucliaDB that is built on top of tantivy and addresses vector search for documents and video search.

  • SmartImage

    Reverse image search tool (SauceNao, IQDB, Ascii2D, trace.moe, and more)

  • tinyvector

    A tiny embedding database in pure Rust.

  • Project mention: Tinyvector - a tiny embedding database in pure Rust | /r/aiengineer | 2023-07-11
  • Seeks

    Seeks is a decentralized p2p websearch and collaborative tool.

  • artadosearch

    Artado Search is open source, private and highly customizable search engine

  • Project mention: Top Google Search Alternatives | /r/Alt0 | 2023-04-23

    Artado

  • multiSearchHome

    :mag_right: Local standalone html homepage to search in 175 search engine (duckduckgo, youtube, twitter, wikipedia, etc..) // FR___: Page d'accueil html autonome, pour chercher dans 175 moteurs de recherche.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-03.

Search Engines related posts

Index

What are some of the best open-source Search Engine projects? This list will help you:

Project Stars
1 the-book-of-secret-knowledge 128,453
2 MeiliSearch 43,043
3 Typesense 17,796
4 qdrant 17,718
5 Yacy 3,244
6 Gigablast 1,515
7 OnionSearch 1,096
8 sist2 755
9 domains 634
10 dark-web-osint-tools 614
11 Nuclia DB 569
12 SmartImage 510
13 tinyvector 335
14 Seeks 262
15 artadosearch 148
16 multiSearchHome 4
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com