pdf-diff
tesseract-ocr-for-php
pdf-diff | tesseract-ocr-for-php | |
---|---|---|
8 | 4 | |
786 | 2,792 | |
- | - | |
1.9 | 4.4 | |
about 1 year ago | 7 months ago | |
Go | PHP | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pdf-diff
-
PDF processing and analysis with open-source tools
This tool might be helpful for comparing pdfs: https://github.com/serhack/pdf-diff
- Show HN: PDF-Diff - Visualize any differences between two PDFs
- Casual Friday - Ferie are coming
- pdf-diff - A tool for visualizing differences between two pdf files.
-
GitHub pdf-diff: A tool for visualizing differences between two pdf files.
I have opened a new PR that should cut time performance. Thanks for the comment btw! Just curious about the (N)RGBA image representation? Why should I use it over RGBA?
- Show HN: Pdf-diff Visualize any differences between two PDFs
tesseract-ocr-for-php
-
PDF processing and analysis with open-source tools
There’s even a library for php (https://github.com/thiagoalessio/tesseract-ocr-for-php). Haven’t used it. I did used python Pytesseract & works fairly well.
- Laravel OCR?
-
What are my options for extracting text from photos? I've already got ImageMagick installed, and assume there's a handful of PHP libraries for this task? Which are most performant and most likely to be maintained?
Depends on how consistent and legible the images are. If you've got, say, a scanned page with black-on-white text, it will work fairly well with PHPOCR (http://phpocr.sourceforge.net/) or https://github.com/thiagoalessio/tesseract-ocr-for-php.
-
Processing Identity Documents in Laravel
The next step is to use Tesseract in our PHP class, to do that we'll use this excellent package
What are some alternatives?
diffpdf
react-native-tesseract-ocr - Tesseract OCR wrapper for React Native
pdfsizeopt - PDF file size optimizer
Laravel - Laravel is a web application framework with expressive, elegant syntax. We’ve already laid the foundation for your next big idea — freeing you to create without sweating the small things.
diff-pdf - A simple tool for visually comparing two PDF files
Symfony - The Symfony PHP framework
Apache PDFBox - Mirror of Apache PDFBox
identitydocuments - A Laravel package for parsing and processing Identity Documents
gotenberg - A developer-friendly API for converting numerous document formats into PDF files, and more!
tessdata - Trained models with fast variant of the "best" LSTM models + legacy models
pdfsam - PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)