Apache PDFBox
OpenPDF
Our great sponsors
Apache PDFBox | OpenPDF | |
---|---|---|
25 | 11 | |
2,149 | 2,888 | |
3.6% | 2.5% | |
9.3 | 0.0 | |
1 day ago | 8 days ago | |
Java | Java | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Apache PDFBox
-
So you want to modify the text of a PDF by hand
If you don't mind using java, you can use the open source Apache PDFBox library
It's relatively performant and it's a mature and supported codebase that can accomplish most pdf tasks.
- best pdf library to use in 2023?
-
How to crop, split, remove pages from PDFs with Java and PDFBox
Then, open the pdf_utils/pom.xml file and add a dependency to PDFBox, in the dependencies section:
- Does no one use PDF files anymore?? In need of a PDF generator package...
- Thoughts on Birt Report for pdf reports
-
How I archived 100 million PDF documents... (Part 1)
So, when I started to view the documents, a lot of them simply failed to open. I had to look around for a library that could verify PDF documents. I had some experience with PDFBox in the past, so it seemed to be a good go-to solution. It had no way to verify documents by default, but it could open and parse them and that was enough to filter out the incorrect ones. It felt a little bit strange just to read the whole PDF into the memory to verify if it is correct or not, but hey I needed a simple fix for now and it worked really well.
- Best FOSS (ideally Docker) that can split PDF files ?
-
PDF processing and analysis with open-source tools
PDFBox can do this. It’s not part of the CLI but it wouldn’t be too hard to add:
https://github.com/apache/pdfbox/blob/5b00807463279f1002e245...
-
I am looking to automate a process at work...
You'll find libraries in most languages for parsing content out of PDF files, I did this most recently at work in Java using PDFBox.
-
Understanding PDF conversion
Something along these lines might be a better bet: https://pdfbox.apache.org/ - no idea if that's the best one, but you should be looking for modern, full-featured PDF manipulation libraries. GhostScript is definitely NOT that - it's an ancient PostScript interpreter that barely, barely supports PDFs.
OpenPDF
-
How do i convert html file to pdf file using java(fx)?
Not really a JavaFX question as JavaFX is a UI framework. Your question is therefore just related to Java. To answer that I've seen a similar feature from IText (https://itextpdf.com) or OpenPDF (https://github.com/LibrePDF/OpenPDF) as an open source alternative. Maybe take a look at those for a start and check if they provide you what you're looking for.
-
Complete guide to open source licenses for developers
In addition, the License file may not contain the text of the license directly but information about the licenses under which the project is published or which projects/libraries are included in it. For example OpenPDF.
- Ask HN: Why is the PDF format so inaccessible?
- Show HN: PDF API – Generate, convert, and modify PDF documents
-
QuestPDF 2022.3 - a new release of the modern, open-source library for PDF generation 🎉 Please help me make it popular 🚀
http://www.pdfsharp.net/ may be for you,I cannot think of another one, maybe the java MPL https://github.com/LibrePDF/OpenPDF
-
Libraries, Frameworks and Technologies you would NOT recommend
OpenPDF is a pretty decent replacement for itext. They forked itext from the version right before itext changed the license and have been actively developing it since.
What are some alternatives?
iText - [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7
Apache FOP - Apache XML Graphics FOP
flyingsaucer - XML/XHTML and CSS 2.1 renderer in pure Java
Open HTML to PDF - An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
Apache POI - Mirror of Apache POI
Dynamic Jasper - Dynamic Reports using Jasper Reports
boxable - Boxable is a library that can be used to easily create tables in pdf documents.
itext7 - iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.