Python Search

Open-source Python projects categorized as Search

Top 23 Python Search Projects

  • algorithms

    Minimal examples of data structures and algorithms in Python

  • Searx

    Privacy-respecting metasearch engine

    Project mention: Does Firefox have the same results as Google chrome? | reddit.com/r/firefox | 2023-05-29

    Here's SearX

  • ONLYOFFICE

    ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises

  • buku

    :bookmark: Personal mini-web in text

    Project mention: Bookmark manager buku v4.8 is released | reddit.com/r/commandline | 2023-02-18
  • searxng

    SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

    Project mention: Does Firefox have the same results as Google chrome? | reddit.com/r/firefox | 2023-05-29

    SearXNG

  • tribler

    Privacy enhanced BitTorrent client with P2P content discovery

    Project mention: Having a 20GB file that lets you ask an offline computer any question is amazing | news.ycombinator.com | 2023-05-22

    [2] The Global Brain - the roadmap, https://github.com/Tribler/tribler/issues/7064

  • txtai

    💡 Semantic search and workflows powered by language models

    Project mention: Customize your own embeddings database | reddit.com/r/programming | 2023-05-13

    Thank you. In the context of this article, txtai can get you started. It can vectorize, store and search data. You can also use an embeddings API provider, if preferred, as shown in this Tweet.

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • elasticsearch-py

    Official Python client for Elasticsearch

    Project mention: Verify Connection to Elasticsearch (2021) | news.ycombinator.com | 2023-04-12
  • elasticsearch-dsl-py

    High level Python client for Elasticsearch

    Project mention: Top Python Coding Repos | dev.to | 2022-09-05

    requests - A simple, yet elegant, HTTP library. sanic - Next generation Python web server/framework | Build fast. Run fast. click - Python composable command line interface toolkit elasticsearch-dsl-py - High level Python client for Elasticsearch panel - A high-level app and dashboarding solution for Python internetarchive - A Python and Command-Line Interface to Archive.org coconut - Simple, elegant, Pythonic functional programming

  • django-haystack

    Modular search for Django

    Project mention: Show HN: I built haystack – client-side workplace search engine for devs | news.ycombinator.com | 2023-01-02
  • marqo

    Vector search for humans.

    Project mention: Kdb.ai Vector Database | news.ycombinator.com | 2023-05-20

    Someone from Marqo here - if you're looking for an end-to-end vector search DB that handles vector search and transformation you should check out marqo. https://github.com/marqo-ai/marqo

  • image-match

    🎇 Quickly search over billions of images

    Project mention: Find visual similar photos | reddit.com/r/selfhosted | 2022-09-17

    Not a full solution but imagematch is a widely adopted algorithm for this purpose. There’s even a very nice dockerized version

  • datasketch

    MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

  • JobFunnel

    Scrape job websites into a single spreadsheet with no duplicates.

  • paperai

    📄 🤖 Semantic search and workflows for medical/scientific papers

    Project mention: [P] Parse research papers into structured data | reddit.com/r/MachineLearning | 2023-01-28

    paperai | paperetl

  • Memacs

    What did I do on February 14th 2007? Visualize your (digital) life in Org-mode

    Project mention: Share your folder structure | reddit.com/r/datacurator | 2023-03-18

    Renaming files isn't an issue any more once you've started using file file referencing methods that are not prone to changed folder paths (alternative method) or even the basic file name (by using the unchanged first part of the file name as long as it is unique among all indexed files). This way, I really don't care about broken links any more because I don´t get them. But you don't get that freedom with most PIM tools except mine, I'm afraid.

  • RecoverPy

    Interactively find and recover deleted or :point_right: overwritten :point_left: files from your terminal

    Project mention: RecoverPy 2.0.5: Recover deleted or overwritten files from your terminal | reddit.com/r/coolgithubprojects | 2023-04-01
  • notion-search-alfred-workflow

    An Alfred workflow to search Notion with instant results

    Project mention: A shortcut to open the Notion search bar from anywhere while browsing would save a significant amount of time. What do you think | reddit.com/r/Notion | 2023-02-08

    Alfred has a Notion workflow and Raycast has a Notion extension that allow for searching Notion using a keyboard shortcut from anywhere in macOS

  • pysolr

    Pysolr — Python Solr client

  • now

    🧞 No-code tool for creating a neural search solution in minutes (by jina-ai)

    Project mention: A Guide to Using OpenTelemetry in Jina for Monitoring and Tracing Applications | dev.to | 2023-02-16

    💡In this post we’re just building out a backend, and not touching on a frontend. To build your own low-code backend+frontend neural search solution, check out Jina NOW.

  • khoj

    Search assistant for your org-mode, markdown notes, beancount transaction and images 🦅

    Project mention: Plugin for AI trained on local notes? | reddit.com/r/ObsidianMD | 2023-04-07

    May I suggest Obsidian Kojh? https://github.com/debanjum/khoj

  • stweet

    Advanced python library to scrap Twitter (tweets, users) from unofficial API

    Project mention: Failed using the new twitter API or alternatives | reddit.com/r/learnpython | 2023-05-11
  • codequestion

    🔎 Semantic search for developers

    Project mention: Introducing the Overflow Offline project | reddit.com/r/programming | 2022-10-21

    GitHub | Article

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-06-01.

Python Search related posts

Index

What are some of the best open-source Search projects in Python? This list will help you:

Project Stars
1 algorithms 22,597
2 Searx 12,887
3 whoogle-search 7,436
4 buku 5,681
5 searxng 4,643
6 tribler 4,244
7 txtai 4,107
8 elasticsearch-py 3,949
9 elasticsearch-dsl-py 3,654
10 django-haystack 3,422
11 marqo 2,970
12 image-match 2,861
13 datasketch 1,969
14 JobFunnel 1,635
15 paperai 953
16 Memacs 910
17 RecoverPy 764
18 notion-search-alfred-workflow 755
19 pysolr 639
20 now 588
21 khoj 479
22 stweet 448
23 codequestion 435
Write Clean Python Code. Always.
Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
www.sonarsource.com