Tesseract OCR

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

scantailor-advanced

21 1,103 0.0 C++

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

I use a £15 arm with a vice grip for my phone from Amazon, copy the files to my laptop and then run a bash for-loop of the tesseract CLI over the resultant files.
I use https://github.com/4lex4/scantailor-advanced to deskew the images and generate the PDF.
It isn't perfect but my purposes are more around research than publication, so, YMMV!
OCRmyPDF

77 11,866 9.6 Python

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

I've used tesseract directly and there definitely is some footguns when it comes to PDFs and being sure not to re-compress them and lose quality.
If you're looking to add a text layer to a PDF (for search purposes for instance) I can highly recommend https://github.com/jbarlow83/OCRmyPDF/
It uses Tesseract and works quite well for most PDFs, I made a semi-functional script before I discovered it and it would have saved a lot of hassle.
InfluxDB

www.influxdata.com
sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Mayan EDMS

34 549 0.0 Python

Free Open Source Document Management System (mirror, no pull request or issues)

This is the OCR engine used by Mayan EDMS[1] which I've used since 2018. The reliability has been topnotch.
[1] https://www.mayan-edms.com/
local_adaptive_binarization

2 124 0.0 C++

Local adaptive image binarization

(2): https://github.com/chriswolfvision/local_adaptive_binarizati...
local_adaptive_binarizati

2 - -

(2): https://github.com/chriswolfvision/local_adaptive_binarizati...
PaddleOCR

60 38,202 8.6 Python

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
EasyOCR

38 21,795 4.6 Python

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
WorkOS

workos.com
sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
BoofCV

20 1,035 8.5 Java

Fast computer vision library for SFM, calibration, fiducials, tracking, image processing, and more.

Image processing strongly depends on what image you wanna use. To find an "auto" approach, that works for every image is nearly impossible...
I once wrote a bookscanner app in Java (https://boofcv.org), where everything was done automatically (preprocessing, object detection / book extraction, skin detection / finger removal, deskewing, line-slope-correction and so on). It was very difficult to adjust the parameters, that at least most of the books looked good.
Tesseract.js

32 33,398 8.2 JavaScript

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

I used the wasm implementation and scanned 1 cereal box label. https://tesseract.projectnaptha.com/

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project