local_adaptive_binarization
EasyOCR
local_adaptive_binarization | EasyOCR | |
---|---|---|
2 | 38 | |
124 | 21,953 | |
- | 1.5% | |
0.0 | 3.6 | |
about 1 year ago | about 1 month ago | |
C++ | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
local_adaptive_binarization
-
Recovering redacted information from pixelated videos
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf
-
Tesseract OCR
(2): https://github.com/chriswolfvision/local_adaptive_binarizati...
EasyOCR
-
Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
- OCR a lot of hand written invoice and records?
-
[P] EasyOCR in C++!
I just uploaded my C++ implementation of EasyOCR, a well known ocr library for python. Also dusted some cobwebbs from some audio related projects as well, feel free to leave feedback or contribute! I only implemented the most salient parts, so certainly could use some community help! Cheers!
-
OCR at Edge on Cloudflare Constellation
EasyOCR is a popular project if you are in an environment where you can use run Python and PyTorch (https://github.com/JaidedAI/EasyOCR). Other open source projects of note are PaddleOCR (https://github.com/PaddlePaddle/PaddleOCR) and docTR (https://github.com/mindee/doctr).
-
Donut: OCR-Free Document Understanding Transformer
The main one was https://github.com/JaidedAI/EasyOCR, mostly because, as promised, it was pretty easy to use, and uses pytorch (which I preferred in case I wanted to tweak it). It has been updated since, but at the time it was using CRNN, which is a solid model, especially for the time - it wasn't (academic) SOTA but not far behind that. I'm sure I could've coaxed better performance than I got out of it with some retraining and hyperparameter tuning.
-
Help with OCR of pixel-y numbers
Anyways, you can give a shot to EasyOCR, pretty solid and flexible
- How to perform document OCR?
-
Python unexpectedly quits (macOS ventura, M1)
The easyocr library: https://github.com/JaidedAI/EasyOCR
- I made a website for a friend who owns a restaurant. He's wondering if there's a way to upload a picture of his menu daily. What is the best way to do this?
-
Raspberry Pi Easyocr
Not used it on a Pi but maybe a Docker version (if there is one) would run? Compose file here
What are some alternatives?
BoofCV - Fast computer vision library for SFM, calibration, fiducials, tracking, image processing, and more.
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
scantailor-advanced - ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
pikepdf - A Python library for reading and writing PDF, powered by QPDF
doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
OpenCV - Open Source Computer Vision Library
awesome-colab-notebooks - Collection of google colaboratory notebooks for fast and easy experiments
im2markup - Neural model for converting Image-to-Markup (by Yuntian Deng yuntiandeng.com)
tesserocr - A Python wrapper for the tesseract-ocr API