naps2
OCRmyPDF
naps2 | OCRmyPDF | |
---|---|---|
85 | 1 | |
2,445 | 18 | |
- | - | |
9.8 | 3.6 | |
16 days ago | almost 2 years ago | |
C# | Python | |
GNU General Public License v3.0 or later | Mozilla Public License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
naps2
-
HP misreads room, awkwardly brags about its “less hated” printers | Opinion: HP's printer business practices have infuriated users for years.
If you have a HP Printer there is scanning software program (Free and Open source) called Not Another PDF Scanner. https://www.naps2.com/ It's simple to use and works really good with HP printers unlike the software that comes with the printers. Everyone who wants to scan with a HP printers needs this software.
-
Paperless-Ngx v2.0.0
Brother DCP-L2550DW here. One of the cheapest b/w multifunction devices with automatic document feeder and reasonable print and scan performance. Works like a charm on Linux, Windows, Android, and IOS.
I am using it with [NAPS2](https://www.naps2.com/), which is brilliantly simple, multi-platform, free, and open-source.
- SumatraPDF Reader
-
Pdftool.org: modify pdfs offline in the browser
If you want to add page, remove page, split, merge, reorder, re-orient all or individual pages, ... May I recommend the glorious NAPS2 [1] ?
It's meant as a scanning tool but works just fine without scanning just drag and drop a pdf on it.
It doesn't do in-page editing or annotation, it's "one layer above" that.
[1] https://www.naps2.com/
-
A note of appreciation for paperless ngx
Tips: For PDF management/splitting/rotation/cropping ect before import: NAPS2 is a good tool (Windows, Linux, Mac). It also support ocr (same as paperless-ngx use) https://www.naps2.com/
- Software welche PDF durchsuchbar macht?
-
Good scanning software?
To scan to image files, use the included Windows Fax & Scan app. To scan to PDF files, use NAPS2.
-
Best way to digitalize reciepts?
The software I use is: https://www.naps2.com/
-
Any tool that can turn a scanned paper into an editable PDF
If you are on windows this can scan and OCR using tesseract. https://www.naps2.com/
-
I need a software for creating Arabic OCR documents from PDF
Try https://www.naps2.com/ It has Arabic support. Once installed with OCR settings, if you drap pdf onto app, it does OCR.
OCRmyPDF
-
OCRmyPDF: Add an OCR text layer to scanned PDF file
As mentioned in the other replies, Google's OCR is limited. OCRmyPDF is designed for PDFs. So if you download a 1000+ page public-domain dictionary off of Archive.org (which is something I do regularly), and you want to re-run the OCR because Internet Archive doesn't tune its OCR very well for multilingual works (if it all), then OCRmyPDF is going to beat Google's automatic OCR every time.
However, I recently paid a programmer to fork OCRmyPDF to give it the option to use Google's OCR engine instead of Tesseract. That fork is here: https://github.com/ualiawan/OCRmyPDF. It's more fiddly than the regular OCRmyPDF, and it requires a Google Cloud Vision account (which charges some fraction of a cent for each page OCRed), but it works well, and in some cases may produce better results than OCRmyPDF, although you must be sure to specify the language of the document.
What are some alternatives?
scantailor-advanced - ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
pdfarranger - Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
GenZ-Save-File-Editor - A simple save file editor for the game "Generation Zero"
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
PDF-real-time-Examples - This repo contains examples of the most popular PDF templates generated using Syncfusion's .NET PDF library. You can use these C# examples in your project to generate PDF documents automatically.
scantailor-universal - ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST
Stirling-PDF - #1 Locally hosted web application that allows you to perform various operations on PDF files
gImageReader - A Gtk/Qt front-end to tesseract-ocr.