tesseract-ocr-for-php
pdf-diff
Our great sponsors
tesseract-ocr-for-php | pdf-diff | |
---|---|---|
4 | 8 | |
2,783 | 786 | |
- | - | |
4.4 | 1.9 | |
7 months ago | 12 months ago | |
PHP | Go | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tesseract-ocr-for-php
-
PDF processing and analysis with open-source tools
There’s even a library for php (https://github.com/thiagoalessio/tesseract-ocr-for-php). Haven’t used it. I did used python Pytesseract & works fairly well.
- Laravel OCR?
-
What are my options for extracting text from photos? I've already got ImageMagick installed, and assume there's a handful of PHP libraries for this task? Which are most performant and most likely to be maintained?
Depends on how consistent and legible the images are. If you've got, say, a scanned page with black-on-white text, it will work fairly well with PHPOCR (http://phpocr.sourceforge.net/) or https://github.com/thiagoalessio/tesseract-ocr-for-php.
-
Processing Identity Documents in Laravel
The next step is to use Tesseract in our PHP class, to do that we'll use this excellent package
pdf-diff
-
PDF processing and analysis with open-source tools
This tool might be helpful for comparing pdfs: https://github.com/serhack/pdf-diff
- Show HN: PDF-Diff - Visualize any differences between two PDFs
- Casual Friday - Ferie are coming
- pdf-diff - A tool for visualizing differences between two pdf files.
-
GitHub pdf-diff: A tool for visualizing differences between two pdf files.
I have opened a new PR that should cut time performance. Thanks for the comment btw! Just curious about the (N)RGBA image representation? Why should I use it over RGBA?
- Show HN: Pdf-diff Visualize any differences between two PDFs
What are some alternatives?
react-native-tesseract-ocr - Tesseract OCR wrapper for React Native
diffpdf
Laravel - Laravel is a web application framework with expressive, elegant syntax. We’ve already laid the foundation for your next big idea — freeing you to create without sweating the small things.
pdfsizeopt - PDF file size optimizer
Symfony - The Symfony PHP framework
diff-pdf - A simple tool for visually comparing two PDF files
identitydocuments - A Laravel package for parsing and processing Identity Documents
Apache PDFBox - Mirror of Apache PDFBox
tessdata - Trained models with fast variant of the "best" LSTM models + legacy models
gotenberg - A developer-friendly API for converting numerous document formats into PDF files, and more!
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
pdfsam - PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages