|1 day ago||14 days ago|
|Apache License 2.0||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
So you want to modify the text of a PDF by hand
3 projects | news.ycombinator.com | 4 Sep 2023
If you don't mind using java, you can use the open source Apache PDFBox library
It's relatively performant and it's a mature and supported codebase that can accomplish most pdf tasks.
best pdf library to use in 2023?
2 projects | /r/learnjava | 27 Jun 2023
How to crop, split, remove pages from PDFs with Java and PDFBox
2 projects | dev.to | 30 May 2023
Then, open the pdf_utils/pom.xml file and add a dependency to PDFBox, in the dependencies section:
Does no one use PDF files anymore?? In need of a PDF generator package...
2 projects | /r/react | 30 Mar 2023
Thoughts on Birt Report for pdf reports
2 projects | /r/java | 18 Jan 2023
How I archived 100 million PDF documents... (Part 1)
6 projects | dev.to | 11 Jan 2023
So, when I started to view the documents, a lot of them simply failed to open. I had to look around for a library that could verify PDF documents. I had some experience with PDFBox in the past, so it seemed to be a good go-to solution. It had no way to verify documents by default, but it could open and parse them and that was enough to filter out the incorrect ones. It felt a little bit strange just to read the whole PDF into the memory to verify if it is correct or not, but hey I needed a simple fix for now and it worked really well.
Best FOSS (ideally Docker) that can split PDF files ?
4 projects | /r/opensource | 29 Oct 2022
PDF processing and analysis with open-source tools
7 projects | news.ycombinator.com | 9 Oct 2022
PDFBox can do this. It’s not part of the CLI but it wouldn’t be too hard to add:
I am looking to automate a process at work...
2 projects | /r/programmer | 13 Sep 2022
You'll find libraries in most languages for parsing content out of PDF files, I did this most recently at work in Java using PDFBox.
Understanding PDF conversion
2 projects | /r/AskComputerScience | 25 Feb 2022
Something along these lines might be a better bet: https://pdfbox.apache.org/ - no idea if that's the best one, but you should be looking for modern, full-featured PDF manipulation libraries. GhostScript is definitely NOT that - it's an ancient PostScript interpreter that barely, barely supports PDFs.
Is using ArrayList good practice for a regular software engineering job?
2 projects | /r/java | 10 Aug 2021
What are some alternatives?
iText - [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7
Apache FOP - Apache XML Graphics FOP
OpenPDF - OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺
flyingsaucer - XML/XHTML and CSS 2.1 renderer in pure Java
Apache POI - Mirror of Apache POI
Dynamic Jasper - Dynamic Reports using Jasper Reports
Open HTML to PDF - An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
Konik - A library to create, read and validate ZUGFeRD compliant invoices. Available for Java and .NET
mupdf - mirrored from git://git.ghostscript.com/mupdf.git
itext7 - iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.
PdfPig - Read and extract text and other content from PDFs in C# (port of PDFBox)