pdfgrep
recoll-webui
pdfgrep | recoll-webui | |
---|---|---|
5 | 1 | |
43 | 3 | |
- | - | |
0.0 | 10.0 | |
over 1 year ago | over 2 years ago | |
Emacs Lisp | Python | |
GNU General Public License v3.0 only | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pdfgrep
-
Recoll ā Full-text search for your desktop
I use this script to make recoll produce pdfgrep-like output so that I can use it with Emacs and pdfgrep.el. This gives a nice interactive way to search through thousands of pdf files.
https://github.com/jeremy-compostella/pdfgrep/pull/8#issueco...
-
Pdfgrep ā a commandline utility to search text in PDF files
For Emacs users there is also https://github.com/jeremy-compostella/pdfgrep which lets you browse the results and open the original docs highlighting the selected match.
-
Search multiple selected pdfs in Org mode Emacs at the same time?
Pdfgrep is another option. It's a command line utility. I think you can just give it the file name of a certain number of PDFs and it'll search through them. There's apparently a pdfgrep mode and Helm apparently has pdfgrep as well. I'm not sure if any will search all open PDF buffers rather than a directory though.
- pdfgrep: Emacs module providing grep comparable facilities but for PDF files
-
Is it possible to search text into OCRed PDFs? How?
The eMacs interface can be found here: https://github.com/jeremy-compostella/pdfgrep (sorry, Iām too lazy to see if someone has created a package for this).
recoll-webui
-
Recoll ā Full-text search for your desktop
Recoll is not just useable for desktop applications, it can also be used as a local web search engine through recoll-webui [1] (link goes to my own repo which has some modifications to make it work with the Searx/SearxNG engine) which in turn can be used as an "engine" in Searx and SearxNG through the recoll engine [2] (which has been merged so it is no longer necessary to pull it from my repo).
This last option makes Searx/SearxNG useable for all types of searches, both local as well as remote. I've been using this exclusively for many years now over a large collection of documents (about 600.000 entries) with good results.
[1] https://github.com/Yetangitu/recoll-webui
[2] https://docs.searxng.org/admin/engines/recoll.html
[2] https://searx.github.io/searx/admin/engines/recoll.html
What are some alternatives?
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
rg.el - Emacs search tool based on ripgrep
fsearch - A fast file search utility for Unix-like systems based on GTK3
docquery - An easy way to extract information from documents
pdf-keywords-extractor
dumb-jump - an Emacs "jump to definition" package for 50+ languages
ede-php-autoload - PHP autoloading simulation for Emacs' Semantic
looqs - FTS desktop file search with previews