Java PDF

Open-source Java projects categorized as PDF

Top 23 Java PDF Projects

  1. Stirling-PDF

    #1 Locally hosted web application that allows you to perform various operations on PDF files

    Project mention: A free, unlimited online PDF converter with Privacy focus | news.ycombinator.com | 2025-01-03

    Congrats on the launch, it is interesting. Do you have plans for open source the project?

    I'm a happy user of Stirling-PDF [1] which provides all my PDF needs. I do host it in my network and not accessible from internet for better privacy.

    [1] https://github.com/Stirling-Tools/Stirling-PDF

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. mit-deep-learning-book-pdf

    MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

    Project mention: Top Github repositories for 10+ programming languages | dev.to | 2024-07-16

    MIT deep learning PDF

  4. grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: Starting July 1, Academic Publishers Can't Paywall NIH-Funded Research | news.ycombinator.com | 2025-05-01

    what do you mean exactly? I was suprised how with grobid many of at least the arXiv papers are easily converted to xml for better processing than PDF.

    Most of the papers are constructed from their latex sources so there's an easy way to undo it i guess.

    https://github.com/kermitt2/grobid

  5. OpenPDF

    OpenPDF is a free Java library for creating and editing PDF files, with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository.

    Project mention: iText PDF Library turns 25 | news.ycombinator.com | 2025-02-20

    OpenPDF is a fork of iText with Lgpl and Mpl license: https://github.com/LibrePDF/OpenPDF

  6. pdfsam

    PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages

    Project mention: JavaFX In Action with Pedro Duque Vieira, aka Duke, about Hero, PDFSam, FXThemes, FXComponents,... | dev.to | 2024-06-25

    He shares insights into a few applications, like Hero (CAD application to calculate energy efficiency) and PDFSam (powerful and professional PDF editor). PDFSam had 100.000 downloads in April '24!

  7. Apache PDFBox

    Mirror of Apache PDFBox

    Project mention: How to Read Thread Dumps – easily & efficiently | dev.to | 2024-12-04

    Case Study: In a real-world incident, an application experienced a deadlock due to a bug in the Apache PDFBox library. The problem arose when two threads acquired locks in opposite orders, resulting in a deadlock that caused the application to hang. To learn more about this case and how the deadlock was resolved, check out Troubleshooting Deadlock in an Apache Open-Source Library.

  8. flyingsaucer

    XML/XHTML and CSS 2.1 renderer in pure Java

    Project mention: Home to Anything JavaFX Related | news.ycombinator.com | 2024-08-17
  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. itext-java

    iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.

    Project mention: How I Made PDF Table Rendering 95% Faster in an Afternoon | news.ycombinator.com | 2025-05-16
  11. pdf2json

    converts binary PDF to JSON and text, for server-side PDF processing and command-line use.

  12. Open HTML to PDF

    An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!

  13. Tabula

    Extract tables from PDF files

  14. iText

    [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7

    Project mention: Adobe’s Monopoly on PDFs: The Innovation Bottleneck and the Need for Open-Source Alternatives | dev.to | 2025-03-01

    Some open-source PDF tools, like iText and Ghostscript, have restrictive licenses, making them impractical for commercial use.

  15. Images-to-PDF

    An app to convert images to PDF file!

  16. jasperreports

    JasperReports® - Free Java Reporting Library

    Project mention: Giới thiệu căn bản về thư viện JasperReports | dev.to | 2024-12-20
  17. PdfViewer

    Simple Android PDF viewer based on pdf.js and content providers. The app doesn't require any permissions. The PDF stream is fed into the sandboxed WebView without giving it access to content or files. CSP is used to enforce that the JavaScript and styling properties within the WebView are entirely static.

  18. document-viewer

    Document Viewer is a highly customizable document viewer for Android.

  19. CERMINE

    Content ExtRactor and MINEr

    Project mention: Show HN: Kreuzberg – Modern async Python library for document text extraction | news.ycombinator.com | 2025-02-15
  20. boxable

    Boxable is a library that can be used to easily create tables in pdf documents.

  21. email-to-pdf-converter

    Converts email files (eml, msg) to pdf

  22. react-native-PDFView

    📚 PDF viewer for React Native

  23. PDFCreatorAndroid

    Simple library to generate and view PDF in Android

  24. Dynamic Jasper

    Dynamic Reports using Jasper Reports

  25. pdfcompare

    A simple Java library to compare two PDF files

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java PDF discussion

Log in or Post with

Java PDF related posts

  • How I Made PDF Table Rendering 95% Faster in an Afternoon

    1 project | news.ycombinator.com | 16 May 2025
  • Starting July 1, Academic Publishers Can't Paywall NIH-Funded Research

    1 project | news.ycombinator.com | 1 May 2025
  • Adobe’s Monopoly on PDFs: The Innovation Bottleneck and the Need for Open-Source Alternatives

    2 projects | dev.to | 1 Mar 2025
  • iText PDF Library turns 25

    2 projects | news.ycombinator.com | 20 Feb 2025
  • A free, unlimited online PDF converter with Privacy focus

    2 projects | news.ycombinator.com | 3 Jan 2025
  • How to Read Thread Dumps – easily & efficiently

    1 project | dev.to | 4 Dec 2024
  • Stirling-PDF: Your All-in-One PDF Solution

    1 project | dev.to | 11 Sep 2024
  • A note from our sponsor - SaaSHub
    www.saashub.com | 24 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source PDF projects in Java? This list will help you:

# Project Stars
1 Stirling-PDF 59,802
2 mit-deep-learning-book-pdf 13,247
3 grobid 4,042
4 OpenPDF 3,864
5 pdfsam 3,790
6 Apache PDFBox 2,825
7 flyingsaucer 2,103
8 itext-java 2,100
9 pdf2json 2,094
10 Open HTML to PDF 2,018
11 Tabula 1,929
12 iText 1,645
13 Images-to-PDF 1,256
14 jasperreports 1,156
15 PdfViewer 724
16 document-viewer 529
17 CERMINE 493
18 boxable 338
19 email-to-pdf-converter 318
20 react-native-PDFView 302
21 PDFCreatorAndroid 278
22 Dynamic Jasper 249
23 pdfcompare 239

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Java is
the 8th most popular programming language
based on number of references?