Our great sponsors
|Apache POI||Apache PDFBox|
|3 days ago||3 days ago|
|-||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
We haven't tracked posts mentioning Apache POI yet.
Tracking mentions began in Dec 2020.
How to crop, split, remove pages from PDFs with Java and PDFBox
2 projects | dev.to | 30 May 2023
Then, open the pdf_utils/pom.xml file and add a dependency to PDFBox, in the dependencies section:
Does no one use PDF files anymore?? In need of a PDF generator package...
2 projects | /r/react | 30 Mar 2023
Thoughts on Birt Report for pdf reports
2 projects | /r/java | 18 Jan 2023
How I archived 100 million PDF documents... (Part 1)
6 projects | dev.to | 11 Jan 2023
So, when I started to view the documents, a lot of them simply failed to open. I had to look around for a library that could verify PDF documents. I had some experience with PDFBox in the past, so it seemed to be a good go-to solution. It had no way to verify documents by default, but it could open and parse them and that was enough to filter out the incorrect ones. It felt a little bit strange just to read the whole PDF into the memory to verify if it is correct or not, but hey I needed a simple fix for now and it worked really well.
Best FOSS (ideally Docker) that can split PDF files ?
4 projects | /r/opensource | 29 Oct 2022
PDF processing and analysis with open-source tools
7 projects | news.ycombinator.com | 9 Oct 2022
PDFBox can do this. It’s not part of the CLI but it wouldn’t be too hard to add:
I am looking to automate a process at work...
2 projects | /r/programmer | 13 Sep 2022
You'll find libraries in most languages for parsing content out of PDF files, I did this most recently at work in Java using PDFBox.
Understanding PDF conversion
2 projects | /r/AskComputerScience | 25 Feb 2022
Something along these lines might be a better bet: https://pdfbox.apache.org/ - no idea if that's the best one, but you should be looking for modern, full-featured PDF manipulation libraries. GhostScript is definitely NOT that - it's an ancient PostScript interpreter that barely, barely supports PDFs.
Help Extracting Data from PDF Files
2 projects | /r/techsupport | 18 Nov 2021
If you can program in Java then Apache PDFBox is an excellent very high quality library for reading (and writing) PDFs.
Wish there was a Java lib for…
5 projects | /r/java | 26 May 2021
Creating PDFs in an easy way. Currently using PDFBox and its kinda painful.
What are some alternatives?
iText - [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7
docx4j - JAXB-based Java library for Word docx, Powerpoint pptx, and Excel xlsx files
Apache FOP - Apache XML Graphics FOP
OpenPDF - OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺
flyingsaucer - XML/XHTML and CSS 2.1 renderer in pure Java
fastexcel - Generate and read big Excel files quickly
Dynamic Jasper - Dynamic Reports using Jasper Reports
documents4j - documents4j is a Java library for converting documents into another document format
Open HTML to PDF - An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
easyexcel - 快速、简洁、解决大文件内存溢出的java处理Excel工具
boxable - Boxable is a library that can be used to easily create tables in pdf documents.
Konik - A library to create, read and validate ZUGFeRD compliant invoices. Available for Java and .NET