gImageReader
tesseract
Our great sponsors
gImageReader | tesseract | |
---|---|---|
15 | 8 | |
1,519 | 2,821 | |
- | 2.6% | |
7.8 | 8.5 | |
29 days ago | 3 months ago | |
C++ | C++ | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gImageReader
-
Making an archive out of my grandfather's writings. What OCR scanning and doc mgt system to use?
On tesseract base here is a software to make a scan a text searchable pdf. It take a bit of time and can be a bit tedious but it does the work! https://github.com/manisandro/gImageReader/releases It does not work well on cursive writing of course. It's a bit less heavy code sided solution. Good luck!
- Is there free software for windows that can read scanned handwriting and turn it into text?
- أحمل برنامج صخر منين؟ دورت عليه كتير مش لاقياه؟ ولو مش موجود حد يعرف أي بديل كويس بيعمل Arabic OCR؟
-
Writer - Tips to remove breaks and hyphenations from PDF to DOC conversion?
I'm working with old newspaper PDFs to convert them into DOC formats. I'm having a great time with gImageReader by highlighting columns and converting them to plain text. Then I take that plain text into Libreoffice Writer (7.0.4.2) to clean up and save. If this were a book as opposed to a newspaper with ads and columns, it would have bee a lot easier to convert and format.
-
Best OCR software for extracting pdf to txt - Paid or Free version.
It would help to know a bit more of your usecase. If you're looking to just extract the text (ie, take all the textual content of your PDF and drop it into a separate text document), there are solutions like ABBYY Finereader and gImageReader. If you're looking to make PDFs searchable (keeping the scanned pages, but adding a text layer underneath so you can search and copy from them), there's NAPS2 (which has an additional command line tool for automation) and OCRmyPDF.
-
Help plz! Tool to enhance pdf text quality?
OpenSource OCR... for desktop users I like "gImageReader" URL: https://github.com/manisandro/gImageReader (Technically is GUI for tessaract)
-
Good Open Source OCR software
gImageReader is the linux standard that I'm aware of. It's a GUI to Tessaeract, but IIRC you can use other models if you have them.
-
What Are The Best Linux Apps?
gImageReader as a simple OCR application
-
OCR Arabic screenshot clipboard captures for Mac
https://github.com/manisandro/gImageReader ^^ seems like it has installers for different OS's
- Is there a good/accurate OCR/Text to Image program available?
tesseract
- OCR software that works?
- Ausschnitt aus der Paderborner Lokalzeitung von 1797. Wie genau nennt sich die Schriftart die damals benutzt wurde?
-
ocrmypdf / file not found error?
Hello - after installing tesseract on windows from here: https://github.com/UB-Mannheim/tesseract/wiki the script now works fine
-
OCR in Windows suggestions?
Plamola's "OCR-Joplin-Notes" extension seems to require the "rest-uploader" from cerealkella, which requires Tesseract, and that itself seems to require Poppler.
-
What useful unknown website do you wish more people knew about?
Windows binaries
-
I Tried Creating My Own OCR Translator Tool Using Python and Tesseract
Requirement: tesseract
- TesseractOCR 5 alpha
-
Great OCR failure of 2021, or why normal users don't bother with GNU, GPL and other free, open source software.
The Tesseract GitHub releases page points here for Windows installers of 4: https://github.com/UB-Mannheim/tesseract/wiki
What are some alternatives?
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
crow-translate - A simple and lightweight translator that allows you to translate and speak text using Google, Yandex Bing, LibreTranslate and Lingva.
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
Chocolatey - Chocolatey - the package manager for Windows
docker-teedy - Multi-architecture Dockerfile for Teedy (formerly Sismics Docs)
dpscreenocr - Program to recognize text on screen
percollate - A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
Screen-Translate - A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.
webapp-manager
edenai-python - The best AI engines in one API: vision, text, speech, translation, OCR, machine learning, etc. SDK and examples for Python developers.
warpinator - Share files across the LAN
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched