local_adaptive_binarization
im2markup
local_adaptive_binarization | im2markup | |
---|---|---|
2 | 1 | |
124 | 1,172 | |
- | 0.2% | |
0.0 | 3.0 | |
about 1 year ago | 6 months ago | |
C++ | Lua | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
local_adaptive_binarization
-
Recovering redacted information from pixelated videos
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf
-
Tesseract OCR
(2): https://github.com/chriswolfvision/local_adaptive_binarizati...
im2markup
What are some alternatives?
BoofCV - Fast computer vision library for SFM, calibration, fiducials, tracking, image processing, and more.
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
scantailor-advanced - ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
local_adaptive_binarizati
pikepdf - A Python library for reading and writing PDF, powered by QPDF
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
qpdf - QPDF: A content-preserving PDF document transformer
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.