Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work. Learn more →
Top 23 Python OCR Projects
-
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29When I was evaluating options a few months ago I found https://github.com/PaddlePaddle/PaddleOCR to be a very strong contender for my use case (reading product labels), but you'll definitely want to put together some representative docs/images and test a bunch of solutions to see what works for you.
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29The main one was https://github.com/JaidedAI/EasyOCR, mostly because, as promised, it was pretty easy to use, and uses pytorch (which I preferred in case I wanted to tweak it). It has been updated since, but at the time it was using CRNN, which is a solid model, especially for the time - it wasn't (academic) SOTA but not far behind that. I'm sure I could've coaxed better performance than I got out of it with some retraining and hyperparameter tuning.
-
CodiumAI
TestGPT | Generating meaningful tests for busy devs. Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.
-
Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29
-
Project mention: What's the BEST way to detect these letters on an image? | reddit.com/r/learnpython | 2023-03-05
If you don't have it already: https://github.com/madmaze/pytesseract
-
Project mention: Github packages/Apps that are must have for Physicists using Linux | reddit.com/r/AskPhysics | 2023-04-16
I have recently discovered a few very helpful github packages which help me make notes while listening to lectures. These would be 1. pix2tex (allows you to scan an equation and convert it to latex) 2. pix2text (allows you to scan an equation with words in it and converts it to latex and text) 3. Tesseract (not really a physics related package, but it does allow me to copy notes from transcripts easily) 4. Mathpix an app that performs all the above mentioned operations better than the packages above, but one which ain't free.
-
I built my own solution using a combination of Tesseract and OpenCV (in python). But even though the source PDF content is computer generated, I still get sporadic OCR errors. After writing my solution, I came across this https://github.com/Layout-Parser/layout-parser which might be a better starting point for dealing with PDFs but I haven't tried it yet.
-
Project mention: MMDeploy: Deploy All the Algorithms of OpenMMLab | reddit.com/r/u_Allent_pjlab | 2022-11-21
MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Project mention: FOSS self-hosted image-to-text gpu accelerated object recognition ? Is there anything on the table yet ? | reddit.com/r/selfhosted | 2023-03-30https://github.com/amusi/awesome-object-detection https://mmdetection.readthedocs.io/en/latest/index.html https://github.com/thtrieu/darkflow https://github.com/OlafenwaMoses/ImageAI https://github.com/dmlc/gluon-cv https://github.com/aim-uofa/AdelaiDet/ https://github.com/aim-uofa/AdelaiDet/blob/master/configs/FCOS-Detection/README.md https://github.com/wizyoung/YOLOv3_TensorFlow
-
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Can someone tell me if this is a virus and in case how to clean it? https://github.com/YaoFANGUK/video-subtitle-extractor/blob/main/README_en.md I know it's not a things totally connected to piracy but idk who else can know this, if anyone can control it thx
-
-
-
CnOCR
CnOCR: Awesome Chinese/English OCR toolkits based on PyTorch/MXNet, It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Project mention: Using multiple ocr for better recognition [D] | reddit.com/r/MachineLearning | 2022-12-14One of tools is CnOCR .
-
pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
-
I think I know where the problem may be. Please open a ticket in github and I will fix it (I need 2-3 days for that)
-
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 (by clovaai)
Project mention: Any way to convert my handwritten diary to searchable PDFs? | reddit.com/r/linuxquestions | 2023-05-27I just saw this on hackernews. Maybe that’s an option?
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Last I checked I saw a grocery bill example using https://github.com/mindee/doctr and was fairly accurate. Bear in mind that was last year, hopefully it got even better or there are other libraries
-
Did you read the instructions for windows? https://github.com/sirfz/tesserocr
-
Snipper source code from textshot
-
Project mention: DeepDoctection: Document extraction and analysis using deep learning models | reddit.com/r/programming | 2023-04-27
-
keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
-
Project mention: Do you guys know where I can read the translated version of Isekai Joshi Kangoku? | reddit.com/r/shoujoai | 2023-05-16
-
BallonsTranslator
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
BallonTranslator https://github.com/dmMaze/BallonsTranslator
-
-
ONLYOFFICE
ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises
Python OCR related posts
- Donut: OCR-Free Document Understanding Transformer
- Donut: OCR-Free Document Understanding Transformer
- Donut: OCR-Free Document Understanding Transformer
- Donut: OCR-Free Document Understanding Transformer
- Ope Source Document Understanding Library using Transformers instead of OCR
- Generative AI for Document Understanding
- massive crop and OCR newspaper
-
A note from our sponsor - Sonar
www.sonarsource.com | 30 May 2023
Index
What are some of the best open-source OCR projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleOCR | 30,627 |
2 | EasyOCR | 18,097 |
3 | OCRmyPDF | 8,920 |
4 | pytesseract | 4,846 |
5 | LaTeX-OCR | 3,900 |
6 | layout-parser | 3,668 |
7 | mmocr | 3,419 |
8 | AdelaiDet | 3,147 |
9 | video-subtitle-extractor | 3,003 |
10 | TextRecognitionDataGenerator | 2,676 |
11 | CRAFT-pytorch | 2,612 |
12 | CnOCR | 2,149 |
13 | pdftabextract | 2,059 |
14 | Papermerge | 1,973 |
15 | donut | 1,948 |
16 | doctr | 1,829 |
17 | tesserocr | 1,790 |
18 | textshot | 1,568 |
19 | deepdoctection | 1,447 |
20 | keras-ocr | 1,184 |
21 | manga-image-translator | 1,081 |
22 | BallonsTranslator | 1,039 |
23 | normcap | 989 |