Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf
Not off the shelf but here are some tools. I have no experience with them.
Wolf binarization - I think it makes the text more clear before OCR.
https://github.com/chriswolfvision/local_adaptive_binarizati...
This thing OCRs the pdf using Tesseract OCR
https://github.com/ocrmypdf/OCRmyPDF/
Two other pdf tools
https://github.com/qpdf/qpdf
https://github.com/pikepdf/pikepdf