Python Search

Open-source Python projects categorized as Search | Edit details

Top 23 Python Search Projects

  • algorithms

    Minimal examples of data structures and algorithms in Python

    Project mention: algorithms VS CXXGraph - a user suggested alternative | libhunt.com/r/algorithms | 2022-02-28
  • jina

    Cloud-native neural search framework for 𝙖𝙣𝙮 kind of data

    Project mention: Released my 2nd best open-source project out of 170 projects shipped in the past 2 yrs to make AI powered search easy | reddit.com/r/opensource | 2022-03-26

    And this latest project Clip as a service has just received 10k stars on GitHub, very close to my other project Jina which has 14k stars. Clip as a service providers an easy and intuitive way to create embeddings for text and images that can be used to create semantic text and image search.

  • SonarLint

    Deliver Cleaner and Safer Code - Right in Your IDE of Choice!. SonarLint is a free and open source IDE extension that identifies and catches bugs and vulnerabilities as you code, directly in the IDE. Install from your favorite IDE marketplace today.

  • Searx

    Privacy-respecting metasearch engine

    Project mention: I'm currently using Orbot (tor) and this happened when I ran a search in Chrome. What the frick? | reddit.com/r/chrome | 2022-05-17

    That page is normal for Tor users, as Google blocks many Tor IPs. It would be better to use a Tor-friendly search engine instead.

  • buku

    :bookmark: Personal mini-web in text

    Project mention: Help me ruminate.. on bookmark managers. | reddit.com/r/selfhosted | 2022-05-18

    I personally use Buku: https://github.com/jarun/buku/ Works pretty well for me, specially with its web frontend (bukuserver).

  • Paperless-ng

    A supercharged version of paperless: scan, index and archive all your physical documents

    Project mention: DMS for history research | reddit.com/r/selfhosted | 2022-05-03

    https://www.mayan-edms.com/ or https://github.com/jonaswinkler/paperless-ng would be my recommendations.

  • tribler

    Privacy enhanced BitTorrent client with P2P content discovery

    Project mention: The United States is censoring the Internet | reddit.com/r/censorship | 2022-03-04

    The other day, I was using Hola VPN set to the United States and went to https://www.tribler.org/, only to discover that it couldn't connect. Hmm, I thought, it must be down. But the Wayback Machine says that it is up. At that point, I went to Tor, and it loaded successfully!!

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • elasticsearch-py

    Official Elasticsearch client library for Python

    Project mention: OpenSearch – open-source search and analytics based on Apache 2.0 Elasticsearch | news.ycombinator.com | 2022-03-05

    FD: I have a friend who works at Elastic, though he doesn't really colour my opinions of things.

    > Firstly, dick moves like this: https://github.com/elastic/elasticsearch-py/pull/1623

    I understand that this is unpopular, but you can make a very strong argument that it's to prevent weird errors in the future. I'm also guilty of littering my code with Asserts to ensure the universe is working fine.

    The alternative is to allow it to work and then you end up with weird issues like when you connect mysql client to mariadb server (and vice-versa): https://stackoverflow.com/questions/50169576/mysql-8-0-11-er...

    > Secondly, I don't buy the argument from Elastic any more. Yes, the ethical thing to do when you're making money from someone's work is at least contribute back. At the same time though, they're making money from packaging it up and selling it _as a service_. That "as a service" part is where they're making the bucks.

    That's just an opinion, yes they have a service, and yes it competes with Amazon. Is it cool for Amazon to take a body of work and sell it without supporting it? Are amazon actually supporting it? Is it the same as Elastic using Lucene? (not really because Elastic submits a the majority of fixes to Lucene, but, you get it).

    it's kinda gray, I'm sure Amazon thinks they're the good guy, but it's hard for me to look at Elastic as the bad guy in all this.

  • elasticsearch-dsl-py

    High level Python client for Elasticsearch

    Project mention: Elasticsearch works but doesn't work in django - any tips? | reddit.com/r/learnpython | 2021-12-26

    I'm not being cryptic. I'm struggling to see how I can be any clearer. You only need to import things you actually use in your code. The only names from Elasticsearch that you use in your code are Search and MultiSearch, and even those are commented out. You don't need to import things that are only used by other imports. Otherwise where would it stop? Obviously the code that defines Match and MultiMatch themselves references other internal Elasticsearch things, and other libraries that are irrelevant to your code - you can see what it imports here. You don't need to import any of those, and for the same reason you don't need to import Match just because it might be referenced by the query method.

  • django-haystack

    Modular search for Django

    Project mention: django haystack: which search engine would be better | reddit.com/r/codehunter | 2022-04-20

    Which of the search engines out of these would be good to use with django haystack

  • image-match

    🎇 Quickly search over billions of images

    Project mention: Here is a demonstration of the new features for the tool I'm working on | reddit.com/r/pathofexile | 2022-02-11

    There are libraries available online like this one that will compare between images and return the closest match.

  • txtai

    💡 Build AI-powered semantic search applications

    Project mention: txtai 4.5 released - offload vectorization externally | reddit.com/r/Python | 2022-05-18
  • datasketch

    MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

  • JobFunnel

    Scrape job websites into a single spreadsheet with no duplicates.

  • Memacs

    What did I do on February 14th 2007? Visualize your (digital) life in Org-mode

    Project mention: Cobbling together a Resonance calendar in org-mode | reddit.com/r/orgmode | 2022-04-03

    https://github.com/novoid/Memacs sounds like it might be somewhat related to your goal of making a timeline of your activity.

  • searxng

    SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

    Project mention: Lukol – A “Privacy” Search Engine | news.ycombinator.com | 2022-05-22

    Or the fork which is much more active these days https://docs.searxng.org/

    In the past it Lukol served Google Ads: https://github.com/privacytools/privacytools.io/issues/1557#...

  • paperai

    📄 🤖 AI-powered literature discovery and review engine for medical/scientific papers

    Project mention: Search on vector embeddings | reddit.com/r/LanguageTechnology | 2021-05-27

    You can also consider implementing a required token strategy for certain terms that must be in the results. See an example of that here: https://github.com/neuml/paperai/blob/master/src/python/paperai/query.py#L104 - example of a query like that would be "great +iphone tutorials"

  • pysolr

    Pysolr — Python Solr client

  • RecoverPy

    Interactively find and recover deleted or :point_right: overwritten :point_left: files from your terminal

    Project mention: Recover deleted/overwritten files with RecoverPy 1.5.0 | reddit.com/r/Python | 2022-04-26

    Hi! I recently release RecoverPy v1.5.0 and I think I might give you some news. -> Repo: https://github.com/PabloLec/RecoverPy -> What is it? RecoverPy is a 100% Python tool to not only recover deleted but also overwritten files. I got the idea when I was quite new to some programming best practices, especially version control... Long story short, I accidentally piped my output into my precious script... Just spent the day working on something and instead of typing myscript > log, I typed log > myscript, oh boy what a feeling. I knew some tools to recover deleted files, but my problem was quite different, I didn't deleted the file (in system words, marked the file blocks as deleted/available), I just replaced it's content. Talk about an impostor syndrome. After a long ride in the abysses of unix stackexchange, I found some dark combination of grep and dd command to search directly in your raw system partitions blocks and eventually recovered my file! But as the process was really slow and painful, I thought it might be a good idea to make a tool out of it. That's how RecoverPy was born.

  • Search Engine Parser

    Lightweight package to query popular search engines and scrape for result titles, links and descriptions

  • codequestion

    💭 Ask coding questions directly from the terminal

    Project mention: Build a QA database | dev.to | 2022-05-06

    For a full example, see codequestion, which is an application that matches user questions to Stack Overflow question-answer pairs.

  • tldrstory

    📊 AI-powered understanding of headlines and story text

  • cherche

    📑 Neural Search

    Project mention: [P] Cherche - allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers. | reddit.com/r/MachineLearning | 2022-01-16
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-05-22.

Python Search related posts

Index

What are some of the best open-source Search projects in Python? This list will help you:

Project Stars
1 algorithms 20,821
2 jina 14,601
3 Searx 11,170
4 whoogle-search 5,413
5 buku 5,086
6 Paperless-ng 4,783
7 tribler 4,053
8 elasticsearch-py 3,632
9 elasticsearch-dsl-py 3,471
10 django-haystack 3,294
11 image-match 2,732
12 txtai 2,157
13 datasketch 1,713
14 JobFunnel 1,553
15 Memacs 823
16 searxng 757
17 paperai 738
18 pysolr 616
19 RecoverPy 368
20 Search Engine Parser 325
21 codequestion 276
22 tldrstory 261
23 cherche 198
Find remote jobs at our new job board 99remotejobs.com. There are 7 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com