Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
If you want a suggestion, use tesserocr instead of Pytesseract. It’s an actual binding to the tesseract library (Python talks to it directly, instead of calling a program as a subprocess), which means it runs more efficiently, you can process multiple images sequentially with the same OCR engine (pytesseract has to start a process and a new engine for every image that gets processed), you get access to more functionality options, and a bunch of other beneficial stuff. If you’re doing preprocessing with OpenCV it’s even possible to pass those arrays directly to tesseract in memory, whereas Pytesseract requires that you save each image to a file before it can process it.
Related posts
- Tesserocr
- [Question] I am trying to segment the image using python.
- Python app that will take a picture, scan it and upload that information into a excel file.
- [Question] Working on a simple OCR program but the text from the image is returned in a backward order and it has trouble reading multiple words on a line
- Can´t get part of this REGEX-pattern to work?