Stirling-PDF
pdfcpu
Stirling-PDF | pdfcpu | |
---|---|---|
22 | 30 | |
25,728 | 6,342 | |
24.3% | 3.2% | |
9.8 | 9.1 | |
about 12 hours ago | 15 days ago | |
Java | Go | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stirling-PDF
-
Stirling PDF: Self-hosted, web-based PDF manipulation tool
Well it was developed initially by ChatGPT. First file I open I see repeated comments.
https://github.com/Stirling-Tools/Stirling-PDF/blob/7f577a60...
-
A small lathe built in a Japanese prison camp
My use-case was easier than yours (microfiche of type) but I found this https://github.com/Stirling-Tools/Stirling-PDF incredibly handy.
I still had to write a bit of Python, but this really is a PDF swiss army knife.
- FLaNK Weekly 31 December 2023
- Stirling-PDF: local web application to perform various operations on PDFs
- SumatraPDF Reader
- App to fill in for PDFs
- A note of appreciation for paperless ngx
- Where does it end? Subscription License Increase.
- I'm an absolute noob/beginner - What are the basics steps to install or self-host Vikunja App?
- Self hosted alternative to smallpdf.com
pdfcpu
- Show HN: A PDF Processing CLI/API Written in Go
- Show HN
-
Making a PDF that's larger than Germany
Slightly tangential: if you are hacking on PDFs, manually or otherwise, this is an incredibly useful tool: https://pdfcpu.io/ (not the author, just a user)
-
Stirling-PDF: local web application to perform various operations on PDFs
A really nice, stand-alone command line tool is pdfcpu.
https://github.com/pdfcpu/pdfcpu
-
pdfcpu v0.6.0 out! - pdfcpu.io
Check it out => https://github.com/pdfcpu/pdfcpu/releases/tag/v0.6.0
-
Marker: Convert PDF to Markdown quickly with high accuracy
I can report that the closest I've came before is with PDFMiner (https://pypi.org/project/pdfminer/) for Python. The benefit of this one is that it retains styling information, so that italics and the like can be retained, at least with some post-processing (I think one might need to convert certain CSS-classes to actual or tags).
The other option I have started looking into is the PDFCPU library for Go. It is a bit more low-level than PDFMiner, but one gets out very well structured info, that seem it might be possible to post-process quite well, for one's particular use case and PDF layouts: https://github.com/pdfcpu/pdfcpu
I also now tried the Marker tool in the OT, and it seems to do a reasonable job. It did intermingle some columns though, at least in some tricky cases such as when there were a round shaped image in between the two columns. One note is that Marker doesn't seem to retain styling like italics though.
-
PDFcpu snippet for read text of PDF file?
Of course, the best way would be to solve it via the API without CLI. But this doesn't seem to work. https://github.com/pdfcpu/pdfcpu/issues/122
- wie splittet ihr denn PDFs - ich hab hier einige - die ich zerlegen muss in Teile
- Do you know any library to make pdf in golang?
- Pdfcpu: A Go PDF Processor
What are some alternatives?
tpotce - 🍯 T-Pot - The All In One Honeypot Platform 🐝
gopdf - A simple library for generating PDF written in Go lang
pdfarranger - Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
go-wkhtmltopdf - Go bindings for wkhtmltopdf and high-level HTML to PDF conversion interface
OpenVoice - Instant voice cloning by MyShell.
qpdf - QPDF: A content-preserving PDF document transformer
naps2 - Scan documents to PDF and more, as simply as possible.
merge2pdf - Merge Image and PDF files (optionally with selective pages) with lossless quality
introduction-to-github - Get started using GitHub in less than an hour.
markpdf - Watermark PDF files using image or text
lxd-dashboard - This LXD dashboard is a web-based user interface (GUI) for managing containers and virtual machines through LXD
ngrok - Unified ingress for developers