ITK
tesseract-ocr
Our great sponsors
ITK | tesseract-ocr | |
---|---|---|
2 | 60 | |
1,014 | 45,209 | |
2.8% | 1.5% | |
9.8 | 9.5 | |
4 days ago | 5 days ago | |
C++ | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ITK
-
Ask HN: What is a cool technology to learn?
Yeah, Prolog is pretty cool!
Another technology I found interesting too learn is ITK (https://itk.org/). You need a different mindset using ITK than other image processing libraries.
Lisp is cool as well.
And fully homomorphic encryption.
-
Give me a starting nudge: microscopy image processing in python
VTK is a powerful visualization package, but it's more for working with 3D FEM/CFD data. There are lots of things in there though, so it may be useful. The sister project Paraview is an application which can be used to work with data interactively. Both have great Python support. There is also ITK which is focused on with image data, like medical scans - never used it, though.
tesseract-ocr
-
Automation for data entry
if your order forms are photos or faxes, you could use an ocr tool like tesseract (https://github.com/tesseract-ocr/tesseract) to convert the images to text so that you can then parse the text and import it into google sheets.
-
Handwriting to text without sending.
Then it uses tesseract to analyze each image, extracting the text.
-
ocr - select screen portion and recognize text from non text source such as videos
Here is a little and unspectacular script to read text from screen. It uses tesseract to ocr the text and import command from ImageMagick to make a screenshot. Then the script outputs the recognized text to stdout. You could replace the screenshot tool with something else you like, but the script expects the created files.
-
scan -> file name from OCR
The only open source OCR I know is Tesseract
-
Looking for advice on scanning a book into ebook format
Ran the first open source command line time for OCR that I could find, in this case https://github.com/tesseract-ocr/tesseract .. the command was pretty straight forward: tesseract -l eng book.tif out_from_tiff Again.. a simple shell script should be easy enough to write and apply it to all pages. The output did have a form feed character at the bottom.. Obviously you can manually delete it but that would take forever.. so simply run..
-
Is it possible to have the bot check if the repost image is mirrored?
That being said, the tool is using a tesseract-related wrapper, and it might be having trouble picking up the text.
-
Laravel OCR?
compiling Tesseract OCR or getting the binary https://github.com/tesseract-ocr/tesseract and downloading it locally to your project
-
Extract Highlighted Text from a Book using Python
I'm going to use the Tesseract OCR engine and library, and its Python wrapper PyTesseract for text extraction. But there are numerous libraries out there to extract text from an image. In a real world application I would probably use cloud services from AWS, Google or Microsoft to handle this task.
-
How to Use Tesseract OCR to Convert PDFs to Text
If this doesn’t fix it then check out this GitHub issue for more troubleshooting steps.
-
How can I do an OCR scan of a PDF that has human handwriting text?
If you want an SDK for this you can use https://github.com/PaddlePaddle/PaddleOCR or https://github.com/tesseract-ocr/tesseract
What are some alternatives?
pytesseract - A Python wrapper for Google Tesseract
OpenCV - Open Source Computer Vision Library
VTK - Mirror of Visualization Toolkit repository
SVG++ - C++ SVG library
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Mayan EDMS - Free Open Source Document Management System (mirror, no pull request or issues)
deep-license-plate-recognition - Automatic License Plate Recognition (ALPR) or Automatic Number Plate Recognition (ANPR) software that works with any camera.
libvips - A fast image processing library with low memory needs.
Face Recognition - The world's simplest facial recognition api for Python and the command line
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
Flutter - Flutter makes it easy and fast to build beautiful apps for mobile and beyond