Python information-retrieval

Open-source Python projects categorized as information-retrieval | Edit details

Top 11 Python information-retrieval Projects

  • GitHub repo EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: [Question] Best approach for Optical Character recognition on large (20MB+) photos? | reddit.com/r/opencv | 2021-11-10

    Try easyocr or Tesseract. Both are pretty easy to use and don't need much background in OpenCV.

  • GitHub repo gensim

    Topic Modelling for Humans

    Project mention: Gensim – a Python library for topic modelling, document indexing | news.ycombinator.com | 2021-11-25
  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo ranking

    Learning to Rank in TensorFlow

    Project mention: [D] learning to Rank | reddit.com/r/MachineLearning | 2021-02-21

    There are many different models and loss functions used for ranking (Tensorflow Ranking offers a bunch, probably also available for Jax / Pytorch / etc., or easily convertible).

  • GitHub repo InvoiceNet

    Deep neural network to extract intelligent information from invoice documents.

    Project mention: Pdfsandwich | news.ycombinator.com | 2021-11-06
  • GitHub repo pke

    Python Keyphrase Extraction module

    Project mention: Question on easing comprehension | dev.to | 2021-09-15
  • GitHub repo forte

    Forte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/

    Project mention: Building Modular and Re-purposable NLP Pipelines | reddit.com/r/learnmachinelearning | 2021-03-02

    Introducing Forte, from the CASL open-source project at Petuum. Forte combines multiple NLP tools to construct an entire NLP pipeline with a few lines of python and extend them to different domains.

  • GitHub repo FreeDiscovery

    Web Service for E-Discovery Analytics

    Project mention: Non-subscription, non-cloud-based review software? | reddit.com/r/ediscovery | 2021-08-31
  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo PatZilla

    PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.

  • GitHub repo nalcos

    Search Git commits in natural language

    Project mention: NaLCoS: Search commit messages in your repository in natural language | news.ycombinator.com | 2021-09-20
  • GitHub repo IP-Tracker

    Track any ip address with IP-Tracker. IP-Tracker is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracker.

    Project mention: nokta atışı ip adresi tespit etme (yorumlarda) | reddit.com/r/KGBTR | 2021-03-30
  • GitHub repo BERT-QE

    Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".

    Project mention: [D] BERT-QE: Contextualized Query Expansion for Document Re-ranking (Research Paper Walkthrough) | reddit.com/r/MachineLearning | 2021-02-24

    ⏩ Paper Title: BERT-QE: Contextualized Query Expansion for Document Re-ranking ⏩ Paper: https://arxiv.org/pdf/2009.07258.pdf ⏩ Code: https://github.com/zh-zheng/BERT-QE ⏩ Author: Zhi Zheng, Kai Hui, Ben He, Xianpei Han, Le Sun, Andrew Yates ⏩ Organisation: University of Chinese Academy of Sciences, Amazon Alexa, Institute of Software, Chinese Academy of Sciences, Max Planck Institute for Informatics

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-11-25.

Python information-retrieval related posts

Index

What are some of the best open-source information-retrieval projects in Python? This list will help you:

Project Stars
1 EasyOCR 13,149
2 gensim 12,694
3 ranking 2,346
4 InvoiceNet 1,868
5 pke 1,036
6 forte 152
7 FreeDiscovery 62
8 PatZilla 58
9 nalcos 48
10 IP-Tracker 29
11 BERT-QE 29
Find remote jobs at our new job board 99remotejobs.com. There are 33 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com