Apache POI
Apache PDFBox
Our great sponsors
Apache POI | Apache PDFBox | |
---|---|---|
0 | 23 | |
1,634 | 2,019 | |
1.2% | 2.7% | |
9.7 | 9.7 | |
3 days ago | 3 days ago | |
Java | Java | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Apache POI
We haven't tracked posts mentioning Apache POI yet.
Tracking mentions began in Dec 2020.
Apache PDFBox
-
How to crop, split, remove pages from PDFs with Java and PDFBox
Then, open the pdf_utils/pom.xml file and add a dependency to PDFBox, in the dependencies section:
- Does no one use PDF files anymore?? In need of a PDF generator package...
- Thoughts on Birt Report for pdf reports
-
How I archived 100 million PDF documents... (Part 1)
So, when I started to view the documents, a lot of them simply failed to open. I had to look around for a library that could verify PDF documents. I had some experience with PDFBox in the past, so it seemed to be a good go-to solution. It had no way to verify documents by default, but it could open and parse them and that was enough to filter out the incorrect ones. It felt a little bit strange just to read the whole PDF into the memory to verify if it is correct or not, but hey I needed a simple fix for now and it worked really well.
- Best FOSS (ideally Docker) that can split PDF files ?
-
PDF processing and analysis with open-source tools
PDFBox can do this. It’s not part of the CLI but it wouldn’t be too hard to add:
https://github.com/apache/pdfbox/blob/5b00807463279f1002e245...
-
I am looking to automate a process at work...
You'll find libraries in most languages for parsing content out of PDF files, I did this most recently at work in Java using PDFBox.
-
Understanding PDF conversion
Something along these lines might be a better bet: https://pdfbox.apache.org/ - no idea if that's the best one, but you should be looking for modern, full-featured PDF manipulation libraries. GhostScript is definitely NOT that - it's an ancient PostScript interpreter that barely, barely supports PDFs.
-
Help Extracting Data from PDF Files
If you can program in Java then Apache PDFBox is an excellent very high quality library for reading (and writing) PDFs.
-
Wish there was a Java lib for…
Creating PDFs in an easy way. Currently using PDFBox and its kinda painful.
What are some alternatives?
iText - [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7
docx4j - JAXB-based Java library for Word docx, Powerpoint pptx, and Excel xlsx files
Apache FOP - Apache XML Graphics FOP
OpenPDF - OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺
flyingsaucer - XML/XHTML and CSS 2.1 renderer in pure Java
fastexcel - Generate and read big Excel files quickly
Dynamic Jasper - Dynamic Reports using Jasper Reports
documents4j - documents4j is a Java library for converting documents into another document format
Open HTML to PDF - An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
easyexcel - 快速、简洁、解决大文件内存溢出的java处理Excel工具
boxable - Boxable is a library that can be used to easily create tables in pdf documents.
Konik - A library to create, read and validate ZUGFeRD compliant invoices. Available for Java and .NET