SaaSHub helps you find the best software and product alternatives Learn more โ
Top 15 Python Tesseract Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
-
J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
BetterOCR
๐ Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with ๐ง LLM.
-
Automatic-License-Plate-Recognition
Automatic License Plate Recognition is implemented using Python, OpenCV and Tesseract to recognize Indian license plates and store the data in a CSV file.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
Project mention: ๐ฅ 600+ ๐ and 140+ Forks to J.A.R.V.I.S ๐, Added Dynamic Face Recognition to J.A.R.V.I.S ๐ค | dev.to | 2023-05-14[GitHub Code](https://github.com/GauravSingh9356/J.A.R.V.I.S
you can train tesseract models as described here, but you would need to create a dataset first: https://github.com/tesseract-ocr/tesstrain
Week 5: ๐Optical Character Recognition (OCR) & ๐Keyword Search
Python Tesseract related posts
- Marker: Convert PDF to Markdown quickly with high accuracy
- A better document viewer
- OCR in-game text using Tesseract
- OCR for a full pdf on Neoreader
- ELI5: why is PDF such a widespread text format, instead of a format that's actually easier to edit?
- [Free-Post Friday!] Recommendations for high volume document scanners
- OCR pdf and just keep the OCR text
-
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024
Index
What are some of the best open-source Tesseract projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | OCRmyPDF | 11,936 |
2 | RPA-Python | 4,525 |
3 | PyMuPDF | 4,002 |
4 | tesserocr | 1,928 |
5 | textshot | 1,677 |
6 | lambda-packs | 1,105 |
7 | J.A.R.V.I.S | 781 |
8 | tesstrain | 568 |
9 | BetterOCR | 383 |
10 | Nkocr | 33 |
11 | Automatic-License-Plate-Recognition | 13 |
12 | hypercube-viewer | 11 |
13 | pytesseract-ocr-plugin | 8 |
14 | schlaumeier | 7 |
15 | koann | 2 |
Sponsored