tesseract-ocr-for-php
pdfsam
Our great sponsors
tesseract-ocr-for-php | pdfsam | |
---|---|---|
4 | 63 | |
2,783 | 3,085 | |
- | - | |
4.4 | 8.5 | |
7 months ago | 5 days ago | |
PHP | Java | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tesseract-ocr-for-php
-
PDF processing and analysis with open-source tools
There’s even a library for php (https://github.com/thiagoalessio/tesseract-ocr-for-php). Haven’t used it. I did used python Pytesseract & works fairly well.
- Laravel OCR?
-
What are my options for extracting text from photos? I've already got ImageMagick installed, and assume there's a handful of PHP libraries for this task? Which are most performant and most likely to be maintained?
Depends on how consistent and legible the images are. If you've got, say, a scanned page with black-on-white text, it will work fairly well with PHPOCR (http://phpocr.sourceforge.net/) or https://github.com/thiagoalessio/tesseract-ocr-for-php.
-
Processing Identity Documents in Laravel
The next step is to use Tesseract in our PHP class, to do that we'll use this excellent package
pdfsam
-
pdfsam VS cpdf-binaries - a user suggested alternative
2 projects | 18 Aug 2023
-
Pdftool.org: modify pdfs offline in the browser
I find pdfsam to be the perfect offline tool for me on Windows.
https://pdfsam.org
And it's open source as well: https://github.com/torakiki/pdfsam
-
PDFSAM is UNSAFE ? VIRUSTOTAL Analysis
This thread may be relevant.
-
Free pdf editor?
That might be fit for your purpose PDFSam
- [Open Source] Quel est l'outil PDF FOSS le plus complet (fendre, fusionner, convertir, etc.)?
-
[Frugal] Recherche d'éditeurs de PDF gratuits
*https://pdfsam.org/
-
My 2023 Bingo Card
use pdfsam.org to split the pdf into individual sheets fyi
-
What's the best tool to merge pdf files?
PDFsam - merge, split, extract pages, rotate and mix your PDF files
-
I made a free PDF editor that works in your browser
PDF Sam is open source and can handle that https://pdfsam.org/
-
Windows 11 Notepad PDF Viewing
I have used tools, like PDFsam, to insert, delete, reorder, and rotate pages in existing PDFs. It's a great tool, but it's not a complete PDF editor.
What are some alternatives?
react-native-tesseract-ocr - Tesseract OCR wrapper for React Native
pdfarranger - Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
Laravel - Laravel is a web application framework with expressive, elegant syntax. We’ve already laid the foundation for your next big idea — freeing you to create without sweating the small things.
pdftk
Symfony - The Symfony PHP framework
sumatrapdf - SumatraPDF reader
identitydocuments - A Laravel package for parsing and processing Identity Documents
pdfcpu - A PDF processor written in Go.
tessdata - Trained models with fast variant of the "best" LSTM models + legacy models
pdf2docx - Open source Python library for converting PDF to DOCX.
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
boxable - Boxable is a library that can be used to easily create tables in pdf documents.