Our great sponsors
-
i7j-rups
RUPS is an acronym for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText® that allows you to look inside a PDF document and browse the different PDF objects and content streams.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
PDF-to-JPG-to-PDF
Convert PDF to JPG images, convert JPG images to PDF, extract select pages from a PDF into a new PDF
If you don't mind using java, you can use the open source Apache PDFBox library
https://pdfbox.apache.org/
It's relatively performant and it's a mature and supported codebase that can accomplish most pdf tasks.
Great post. I've spend a lot of time reading through the PDF specification over the last ~5 years while building DocSpring [1], and I still feel like I've barely scratched the surface. qpdf is a great tool. One of my other favorites is RUPS [2], which really lets you dig into the structure of a PDF.
[1] https://docspring.com
[2] https://github.com/itext/i7j-rups
Semi-related(?) - I created a repository to convert PDF to JPG and back to PDF:
https://github.com/whyboris/PDF-to-JPG-to-PDF
A government form didn't have editable fields that needed to be filled out. And editing the PDF was impossible (password protection). This was my solution.