tessen
OCRmyPDF
Our great sponsors
tessen | OCRmyPDF | |
---|---|---|
14 | 77 | |
65 | 11,936 | |
- | 4.3% | |
6.2 | 9.6 | |
about 2 months ago | 10 days ago | |
Shell | Python | |
GNU General Public License v3.0 only | Mozilla Public License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tessen
-
KeePassXC 2.7.0 Released
Looks like a known issue.
https://github.com/keepassxreboot/keepassxc/issues/2281
I'm considering adding support for keepassxc in tessen but autotype works only on wlroots based compositors like sway right now.
https://github.com/ayushnix/tessen/issues/19
-
Some tiny personal programs I've written
This may not be as impressive but I wrote a script to eliminate one of the primary roadblocks I faced when I moved to Wayland on my Linux desktop — a script to copy and autotype password store amd gopass data, kinda like rofi-pass
https://github.com/ayushnix/tessen
-
tessen v2 released: support for gopass added
tessen is a bash script to autotype and copy password store data on wayland compositors. The latest release of tessen adds support for gopass as well, although parsing YAML files isn't supported. If gopass files use the same format mentioned here, tessen should work fine.
-
Wayland native desktop launcher with password-store support?
I'd be willing to package it for Void Linux but I don't really have any experience with how Void Linux packages work. tessen is just a shell script so you can download it and place it in your $PATH and use it if you want.
-
Fuzzel 1.7 was released with lots of improvements and fixes
As I've mentioned in the isse tracker here, the only thing that's missing from fuzzel is support for using a configuration file. If it gets that, it would replace every other launcher and dmenu program on Wayland for me and would also allow me to go ahead and brand fuzzel as the default dmenu backend for tessen.
- tessen: an interactive menu to autotype and copy password store data on Wayland, like rofi-pass
-
tessen v1.2.1 released: autotype and copy password store data on Wayland, like rofi-pass
I made this post a few weeks ago about tessen's initial release. Since then, I've added a few features that might've prevented rofi-pass users from switching to Wayland based compositors (I was one of them).
-
(Help) How to use wtype?
I ran into this problem as well when making tessen. fzf doesn't have a GUI like rofi, bemenu, and wofi do, so you can't use fzf to type in data in anything else besides the terminal in which it was opened, at least not without resorting to ugly hacks (which is what the swaymsg window move method is).
-
tessen: autotype and copy password-store data on Wayland, like rofi-pass
Support for wofi has been added as well
OCRmyPDF
-
TextSnatcher: Copy text from images, for the Linux Desktop
Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
- FLaNK Stack Weekly 19 Feb 2024
-
Calibre – New in Calibre 7.0
I recommend running any such PDFs through OCRmyPDF.
https://github.com/ocrmypdf/OCRmyPDF
-
A better document viewer
If by "like a photocopy" you mean the file contains images of text rather than text, the MacOS viewer presumably does OCR on the images. I don't know if there's a Linux document viewer with that capability built-in, but a quick search turned up the standalone tool OCRmyPDF.
- Gibts ein (CLI) tool, das Kontrast und Helligkeit von gescannten Textdokumenten dynamisch anpasst?
-
OCR for a full pdf on Neoreader
For anyone interested I solved the problem by first ocr files through the free and open source software ocrmypdf avaible here
-
ELI5: why is PDF such a widespread text format, instead of a format that's actually easier to edit?
ocrmypdf is nice for stuff like that.
- Donut: OCR-Free Document Understanding Transformer
-
massive crop and OCR newspaper
Use imagemagick to convert them to PDF and ocrmypdf to straighten and OCR. See this explanation.
-
OCR pdf and just keep the OCR text
Fair enough, maybe this might work for you, it should seperate the text from image anyway and if you have Adobe acrobat it should be able delete the background too with the edit function. It may already be able to do that if you haven't tried it
What are some alternatives?
bemenu - Dynamic menu library and client program inspired by dmenu
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
rofi - Rofi: A window switcher, run dialog and dmenu replacement - fork with wayland support
pdfplumber - Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
rofi-pass - rofi frontend for pass
tesserocr - A Python wrapper for the tesseract-ocr API
pass-grave - An extension for pass (the standard Unix password manager) to easily hide the metadata of the password store
Paperless-ng - A supercharged version of paperless: scan, index and archive all your physical documents
rofi-emoji - Emoji selector plugin for Rofi
invoice2data - Extract structured data from PDF invoices
pass-tessen - fuzzy data selection and copy-paste from password store
pdfminer.six - Community maintained fork of pdfminer - we fathom PDF