pdf-keywords-extractor
pdfgrep
pdf-keywords-extractor | pdfgrep | |
---|---|---|
5 | 5 | |
25 | 43 | |
- | - | |
0.0 | 0.0 | |
over 1 year ago | over 1 year ago | |
RobotFramework | Emacs Lisp | |
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pdf-keywords-extractor
-
PDF word analyiser
This automation will surface for each page whether the word is present or not in a CSV. You can then load that CSV and count in excel
-
Pdfgrep ā a commandline utility to search text in PDF files
Tangential:
Some time ago I built an automation [1] that automatically identifies whether the given PDFs contain the specified keywords, outputting the result as a CSV file.
Similar to PDFGrep, probably much slower, but potentially more convenient for people preferring GUIs
[1] https://github.com/bendersej/pdf-keywords-extractor
- I made an open-source automation to extract keywords from any PDFs
- Show HN: I built an open source automation to extract keywords from PDFs
pdfgrep
-
Recoll ā Full-text search for your desktop
I use this script to make recoll produce pdfgrep-like output so that I can use it with Emacs and pdfgrep.el. This gives a nice interactive way to search through thousands of pdf files.
https://github.com/jeremy-compostella/pdfgrep/pull/8#issueco...
-
Pdfgrep ā a commandline utility to search text in PDF files
For Emacs users there is also https://github.com/jeremy-compostella/pdfgrep which lets you browse the results and open the original docs highlighting the selected match.
-
Search multiple selected pdfs in Org mode Emacs at the same time?
Pdfgrep is another option. It's a command line utility. I think you can just give it the file name of a certain number of PDFs and it'll search through them. There's apparently a pdfgrep mode and Helm apparently has pdfgrep as well. I'm not sure if any will search all open PDF buffers rather than a directory though.
- pdfgrep: Emacs module providing grep comparable facilities but for PDF files
-
Is it possible to search text into OCRed PDFs? How?
The eMacs interface can be found here: https://github.com/jeremy-compostella/pdfgrep (sorry, Iām too lazy to see if someone has created a package for this).
What are some alternatives?
docquery - An easy way to extract information from documents
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
rpaframework - Collection of open-source libraries and tools for Robotic Process Automation (RPA), designed to be used with both Robot Framework and Python
rg.el - Emacs search tool based on ripgrep
looqs - FTS desktop file search with previews
recoll-webui - web interface for recoll desktop search
pdfgrep
dumb-jump - an Emacs "jump to definition" package for 50+ languages
ede-php-autoload - PHP autoloading simulation for Emacs' Semantic