pdf-keywords-extractor
pdfgrep
pdf-keywords-extractor | pdfgrep | |
---|---|---|
5 | 4 | |
25 | - | |
- | - | |
0.0 | - | |
over 1 year ago | - | |
RobotFramework | ||
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pdf-keywords-extractor
-
PDF word analyiser
This automation will surface for each page whether the word is present or not in a CSV. You can then load that CSV and count in excel
-
Pdfgrep – a commandline utility to search text in PDF files
Tangential:
Some time ago I built an automation [1] that automatically identifies whether the given PDFs contain the specified keywords, outputting the result as a CSV file.
Similar to PDFGrep, probably much slower, but potentially more convenient for people preferring GUIs
[1] https://github.com/bendersej/pdf-keywords-extractor
- I made an open-source automation to extract keywords from any PDFs
- Show HN: I built an open source automation to extract keywords from PDFs
pdfgrep
-
Would you prefer a hard copy of a reference manual over a PDF?
Have you seen https://gitlab.com/pdfgrep/pdfgrep ?
-
Pdfgrep – a commandline utility to search text in PDF files
Looking at the list of dependencies, it seems like they use poppler-cpp to render the PDFs.
https://gitlab.com/pdfgrep/pdfgrep#dependencies
-
Can Okular display current match and total matches in PDF find
I don’t know for sure but I believe not. If you’re not searching too often and/or would like to be able to search using regex as well pdfgrep might be a nice option for you.
What are some alternatives?
docquery - An easy way to extract information from documents
pdfgrep - PDFGrep is a GNU/Emacs module providing grep comparable facilities but for PDF files
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
rpaframework - Collection of open-source libraries and tools for Robotic Process Automation (RPA), designed to be used with both Robot Framework and Python
looqs - FTS desktop file search with previews