Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
For my second attempt I tried -- an ARM compatible image -- Teedy (https://github.com/jdreinhardt/docker-teedy vs https://github.com/sismics/docs). For this I had to convert my TIFs to PNGs, and I'd guess that Teedy might use Tesseract for OCR as well. Unfortunately, I don't think Teedy can export the OCR'd files in a PDF with selectable text.
For my first attempt I used gImageReader (https://github.com/manisandro/gImageReader). I scanned the papers to TIF files, and I'm wondering if scanning to PDF or another format would be better.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Making an archive out of my grandfather's writings. What OCR scanning and doc mgt system to use?
- Is there free software for windows that can read scanned handwriting and turn it into text?
- أحمل برنامج صخر منين؟ دورت عليه كتير مش لاقياه؟ ولو مش موجود حد يعرف أي بديل كويس بيعمل Arabic OCR؟
- Writer - Tips to remove breaks and hyphenations from PDF to DOC conversion?
- Help plz! Tool to enhance pdf text quality?