SaaSHub helps you find the best software and product alternatives Learn more →
Apache PDFBox Alternatives
Similar projects and alternatives to Apache PDFBox
[DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7
Apache XML Graphics FOP
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺
XML/XHTML and CSS 2.1 renderer in pure Java
Dynamic Reports using Jasper Reports
Mirror of Apache POI
Open HTML to PDF
An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
Write Clean Java Code. Always.. Sonar helps you commit clean code every time. With over 600 unique rules to find Java bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Boxable is a library that can be used to easily create tables in pdf documents.
PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages
mirrored from git://git.ghostscript.com/mupdf.git (by ccxvii)
A library to create, read and validate ZUGFeRD compliant invoices. Available for Java and .NET
Tesseract Open Source OCR Engine (main repository)
Universal markup converter
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.
Tabula is a tool for liberating data tables trapped inside PDF files
The awesome document factory
Mirror of Apache HTTP Server. Issues: http://issues.apache.org
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Apache PDFBox reviews and mentions
How to crop, split, remove pages from PDFs with Java and PDFBox
2 projects | dev.to | 30 May 2023
Then, open the pdf_utils/pom.xml file and add a dependency to PDFBox, in the dependencies section:
Does no one use PDF files anymore?? In need of a PDF generator package...
2 projects | /r/react | 30 Mar 2023
Thoughts on Birt Report for pdf reports
2 projects | /r/java | 18 Jan 2023
How I archived 100 million PDF documents... (Part 1)
6 projects | dev.to | 11 Jan 2023
So, when I started to view the documents, a lot of them simply failed to open. I had to look around for a library that could verify PDF documents. I had some experience with PDFBox in the past, so it seemed to be a good go-to solution. It had no way to verify documents by default, but it could open and parse them and that was enough to filter out the incorrect ones. It felt a little bit strange just to read the whole PDF into the memory to verify if it is correct or not, but hey I needed a simple fix for now and it worked really well.
Best FOSS (ideally Docker) that can split PDF files ?
4 projects | /r/opensource | 29 Oct 2022
PDF processing and analysis with open-source tools
7 projects | news.ycombinator.com | 9 Oct 2022
PDFBox can do this. It’s not part of the CLI but it wouldn’t be too hard to add:
I am looking to automate a process at work...
2 projects | /r/programmer | 13 Sep 2022
You'll find libraries in most languages for parsing content out of PDF files, I did this most recently at work in Java using PDFBox.
Understanding PDF conversion
2 projects | /r/AskComputerScience | 25 Feb 2022
Something along these lines might be a better bet: https://pdfbox.apache.org/ - no idea if that's the best one, but you should be looking for modern, full-featured PDF manipulation libraries. GhostScript is definitely NOT that - it's an ancient PostScript interpreter that barely, barely supports PDFs.
Help Extracting Data from PDF Files
2 projects | /r/techsupport | 18 Nov 2021
If you can program in Java then Apache PDFBox is an excellent very high quality library for reading (and writing) PDFs.
Wish there was a Java lib for…
5 projects | /r/java | 26 May 2021
Creating PDFs in an easy way. Currently using PDFBox and its kinda painful.
A note from our sponsor - #<SponsorshipServiceOld:0x00007f0921744178>
www.saashub.com | 7 Jun 2023
apache/pdfbox is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of Apache PDFBox is Java.