Python OCR

Open-source Python projects categorized as OCR

Top 23 Python OCR Projects

  • PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

    Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29

    When I was evaluating options a few months ago I found https://github.com/PaddlePaddle/PaddleOCR to be a very strong contender for my use case (reading product labels), but you'll definitely want to put together some representative docs/images and test a bunch of solutions to see what works for you.

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29

    The main one was https://github.com/JaidedAI/EasyOCR, mostly because, as promised, it was pretty easy to use, and uses pytorch (which I preferred in case I wanted to tweak it). It has been updated since, but at the time it was using CRNN, which is a solid model, especially for the time - it wasn't (academic) SOTA but not far behind that. I'm sure I could've coaxed better performance than I got out of it with some retraining and hyperparameter tuning.

  • CodiumAI

    TestGPT | Generating meaningful tests for busy devs. Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

    Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29
  • pytesseract

    A Python wrapper for Google Tesseract

    Project mention: What's the BEST way to detect these letters on an image? | reddit.com/r/learnpython | 2023-03-05

    If you don't have it already: https://github.com/madmaze/pytesseract

  • LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

    Project mention: Github packages/Apps that are must have for Physicists using Linux | reddit.com/r/AskPhysics | 2023-04-16

    I have recently discovered a few very helpful github packages which help me make notes while listening to lectures. These would be 1. pix2tex (allows you to scan an equation and convert it to latex) 2. pix2text (allows you to scan an equation with words in it and converts it to latex and text) 3. Tesseract (not really a physics related package, but it does allow me to copy notes from transcripts easily) 4. Mathpix an app that performs all the above mentioned operations better than the packages above, but one which ain't free.

  • layout-parser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    Project mention: Crates for converting PDF's into Markdown | reddit.com/r/rust | 2023-01-06

    I built my own solution using a combination of Tesseract and OpenCV (in python). But even though the source PDF content is computer generated, I still get sporadic OCR errors. After writing my solution, I came across this https://github.com/Layout-Parser/layout-parser which might be a better starting point for dealing with PDFs but I haven't tried it yet.

  • mmocr

    OpenMMLab Text Detection, Recognition and Understanding Toolbox

    Project mention: MMDeploy: Deploy All the Algorithms of OpenMMLab | reddit.com/r/u_Allent_pjlab | 2022-11-21

    MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • AdelaiDet

    AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

    Project mention: FOSS self-hosted image-to-text gpu accelerated object recognition ? Is there anything on the table yet ? | reddit.com/r/selfhosted | 2023-03-30

    https://github.com/amusi/awesome-object-detection https://mmdetection.readthedocs.io/en/latest/index.html https://github.com/thtrieu/darkflow https://github.com/OlafenwaMoses/ImageAI https://github.com/dmlc/gluon-cv https://github.com/aim-uofa/AdelaiDet/ https://github.com/aim-uofa/AdelaiDet/blob/master/configs/FCOS-Detection/README.md https://github.com/wizyoung/YOLOv3_TensorFlow

  • video-subtitle-extractor

    视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

    Project mention: possible virus? | reddit.com/r/Piracy | 2023-02-07

    Can someone tell me if this is a virus and in case how to clean it? https://github.com/YaoFANGUK/video-subtitle-extractor/blob/main/README_en.md I know it's not a things totally connected to piracy but idk who else can know this, if anyone can control it thx

  • TextRecognitionDataGenerator

    A synthetic data generator for text recognition

  • CRAFT-pytorch

    Official implementation of Character Region Awareness for Text Detection (CRAFT)

  • CnOCR

    CnOCR: Awesome Chinese/English OCR toolkits based on PyTorch/MXNet, It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

    Project mention: Using multiple ocr for better recognition [D] | reddit.com/r/MachineLearning | 2022-12-14

    One of tools is CnOCR .

  • pdftabextract

    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

  • Papermerge

    Open Source Document Management System for Digital Archives (Scanned Documents)

    Project mention: Simplified Docker Image | reddit.com/r/Papermerge | 2023-03-15

    I think I know where the problem may be. Please open a ticket in github and I will fix it (I need 2-3 days for that)

  • donut

    Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 (by clovaai)

    Project mention: Any way to convert my handwritten diary to searchable PDFs? | reddit.com/r/linuxquestions | 2023-05-27

    I just saw this on hackernews. Maybe that’s an option?

  • doctr

    docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

    Project mention: DeepDoctection | news.ycombinator.com | 2023-04-26

    Last I checked I saw a grocery bill example using https://github.com/mindee/doctr and was fairly accurate. Bear in mind that was last year, hopefully it got even better or there are other libraries

  • tesserocr

    A Python wrapper for the tesseract-ocr API

    Project mention: Tesserocr | reddit.com/r/pycharm | 2023-01-25

    Did you read the instructions for windows? https://github.com/sirfz/tesserocr

  • textshot

    Python tool for grabbing text via screenshot

    Project mention: Language-Translation-App-using-CHATGPT | dev.to | 2023-01-26

    Snipper source code from textshot

  • deepdoctection

    A Repo For Document AI

    Project mention: DeepDoctection: Document extraction and analysis using deep learning models | reddit.com/r/programming | 2023-04-27
  • keras-ocr

    A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.

    Project mention: FLaNK Stack Weekly for 30 April 2023 | dev.to | 2023-04-30
  • manga-image-translator

    Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

    Project mention: Do you guys know where I can read the translated version of Isekai Joshi Kangoku? | reddit.com/r/shoujoai | 2023-05-16
  • BallonsTranslator

    深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning

    Project mention: Ch. 57 English Translation | reddit.com/r/InterspeciesReviewers | 2023-04-29

    BallonTranslator https://github.com/dmMaze/BallonsTranslator

  • normcap

    OCR powered screen-capture tool to capture information instead of images

    Project mention: NormCap: OCR powered screen-capture tool | news.ycombinator.com | 2023-04-27

    https://github.com/dynobo/normcap#similar-open-source-tools

  • ONLYOFFICE

    ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-29.

Python OCR related posts

Index

What are some of the best open-source OCR projects in Python? This list will help you:

Project Stars
1 PaddleOCR 30,627
2 EasyOCR 18,097
3 OCRmyPDF 8,920
4 pytesseract 4,846
5 LaTeX-OCR 3,900
6 layout-parser 3,668
7 mmocr 3,419
8 AdelaiDet 3,147
9 video-subtitle-extractor 3,003
10 TextRecognitionDataGenerator 2,676
11 CRAFT-pytorch 2,612
12 CnOCR 2,149
13 pdftabextract 2,059
14 Papermerge 1,973
15 donut 1,948
16 doctr 1,829
17 tesserocr 1,790
18 textshot 1,568
19 deepdoctection 1,447
20 keras-ocr 1,184
21 manga-image-translator 1,081
22 BallonsTranslator 1,039
23 normcap 989
Write Clean Python Code. Always.
Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
www.sonarsource.com