Apache PDFBox
Open HTML to PDF
Our great sponsors
Apache PDFBox | Open HTML to PDF | |
---|---|---|
25 | 3 | |
2,149 | 1,734 | |
3.6% | - | |
9.3 | 0.0 | |
1 day ago | 28 days ago | |
Java | Java | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Apache PDFBox
-
So you want to modify the text of a PDF by hand
If you don't mind using java, you can use the open source Apache PDFBox library
It's relatively performant and it's a mature and supported codebase that can accomplish most pdf tasks.
- best pdf library to use in 2023?
-
How to crop, split, remove pages from PDFs with Java and PDFBox
Then, open the pdf_utils/pom.xml file and add a dependency to PDFBox, in the dependencies section:
- Does no one use PDF files anymore?? In need of a PDF generator package...
- Thoughts on Birt Report for pdf reports
-
How I archived 100 million PDF documents... (Part 1)
So, when I started to view the documents, a lot of them simply failed to open. I had to look around for a library that could verify PDF documents. I had some experience with PDFBox in the past, so it seemed to be a good go-to solution. It had no way to verify documents by default, but it could open and parse them and that was enough to filter out the incorrect ones. It felt a little bit strange just to read the whole PDF into the memory to verify if it is correct or not, but hey I needed a simple fix for now and it worked really well.
- Best FOSS (ideally Docker) that can split PDF files ?
-
PDF processing and analysis with open-source tools
PDFBox can do this. It’s not part of the CLI but it wouldn’t be too hard to add:
https://github.com/apache/pdfbox/blob/5b00807463279f1002e245...
-
I am looking to automate a process at work...
You'll find libraries in most languages for parsing content out of PDF files, I did this most recently at work in Java using PDFBox.
-
Understanding PDF conversion
Something along these lines might be a better bet: https://pdfbox.apache.org/ - no idea if that's the best one, but you should be looking for modern, full-featured PDF manipulation libraries. GhostScript is definitely NOT that - it's an ancient PostScript interpreter that barely, barely supports PDFs.
Open HTML to PDF
-
best pdf library to use in 2023?
I would suggest Open HTML to PDF, which is a high-level library to build PDF using HTML strings, so you don't have to learn new complex APIs (assuming you already know HTML and CSS).
-
How do you generate PDF reports from HTML?
If you want to generate accessible PDF/UA compliant PDFs, I find https://github.com/danfickle/openhtmltopdf to be the only solution.
-
Is Jasper still the best Report Engine?
There is also https://github.com/danfickle/openhtmltopdf/ , which is a fork of flying saucer which has more features and is more actively developed :)
What are some alternatives?
iText - [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7
Apache FOP - Apache XML Graphics FOP
OpenPDF - OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺
flyingsaucer - XML/XHTML and CSS 2.1 renderer in pure Java
Apache POI - Mirror of Apache POI
Dynamic Jasper - Dynamic Reports using Jasper Reports
boxable - Boxable is a library that can be used to easily create tables in pdf documents.
itext7-dotnet - iText for .NET is the .NET version of the iText library, formerly known as iTextSharp, which it replaces. iText represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enha
itext7 - iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.
Konik - A library to create, read and validate ZUGFeRD compliant invoices. Available for Java and .NET