Java PDF

Open-source Java projects categorized as PDF | Edit details

Top 21 Java PDF Projects

  • GitHub repo OpenPDF

    OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺

    Project mention: Can I use OpenPDF (unmodified) in my commercial application without distributing the source code of my app? | reddit.com/r/learnjava | 2021-05-27

    IANAL, but if it's this (https://github.com/LibrePDF/OpenPDF) it seems to be LGPL, which means you can use it with a closed source, commercial app. There are additional restrictions however, so it'd be wise to do some more thorough reading around the license and what it allows (and doesn't.)

  • GitHub repo pdfsam

    PDFsam, a desktop application to extract pages, split, merge, mix and rotate PDF files

    Project mention: Selfhosted PDF-Splitter/Joiner/Compressor? | reddit.com/r/selfhosted | 2021-11-29
  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo flyingsaucer

    XML/XHTML and CSS 2.1 renderer in pure Java

    Project mention: Is Jasper still the best Report Engine? | reddit.com/r/java | 2021-07-02

    Personnaly, I'm a big fan of Flying Saucer.

  • GitHub repo grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: Project to rebuild papers with plaintext markup languages | reddit.com/r/Open_Science | 2021-09-25

    - I ended up using Grobid, which converts the PDF to a very detailed XML format. The format is not a word processing format though, but a format specifically for representing scientific documents. I don't know, if it would, for example, contain tags about bold or italicized text. The tool is working really well, but since you probably cannot use the output XML format directly, it will need some postprocessing, which would be relatively simple with XML parsing libraries.

  • GitHub repo Apache PDFBox

    Mirror of Apache PDFBox

    Project mention: Help Extracting Data from PDF Files | reddit.com/r/techsupport | 2021-11-18

    If you can program in Java then Apache PDFBox is an excellent very high quality library for reading (and writing) PDFs.

  • GitHub repo iText

    [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7

  • GitHub repo Tabula

    Extract tables from PDF files

    Project mention: Tabula – Extract tables from PDF files | reddit.com/r/patient_hackernews | 2021-06-08
  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo Open HTML to PDF

    An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!

    Project mention: Is Jasper still the best Report Engine? | reddit.com/r/java | 2021-07-02

    There is also https://github.com/danfickle/openhtmltopdf/ , which is a fork of flying saucer which has more features and is more actively developed :)

  • GitHub repo Images-to-PDF

    An app to convert images to PDF file!

    Project mention: A foss "PDF scanner", alternatives to CamScanner | reddit.com/r/fossdroid | 2020-12-30

    Check this out. https://github.com/Swati4star/Images-to-PDF

  • GitHub repo document-viewer

    Document Viewer is a highly customizable document viewer for Android.

    Project mention: Any FOSS fork of EbookDroid? | reddit.com/r/fossdroid | 2021-11-25

    https://github.com/SufficientlySecure/document-viewer but seems abandoned as well.

  • GitHub repo CERMINE

    Content ExtRactor and MINEr

    Project mention: Project to rebuild papers with plaintext markup languages | reddit.com/r/Open_Science | 2021-09-25

    - Another alternative that's on my list but that I didn't try is Cermine.

  • GitHub repo boxable

    Boxable is a library that can be used to easily create tables in pdf documents.

    Project mention: Is using ArrayList good practice for a regular software engineering job? | reddit.com/r/java | 2021-08-10
  • GitHub repo Dynamic Jasper

    Dynamic Reports using Jasper Reports

  • GitHub repo PDFCreatorAndroid

    Simple library to generate and view PDF in Android

  • GitHub repo Apache FOP

    Mirror of Apache FOP

  • GitHub repo PdfViewer

    Simple Android PDF viewer based on pdf.js and content providers. The app doesn't require any permissions. The PDF stream is fed into the sandboxed WebView without giving it access to content or files. CSP is used to enforce that the JavaScript and styling properties within the WebView are entirely static. (by GrapheneOS)

    Project mention: ⟳ 0 apps added, 4 updated at apt.izzysoft.de | reddit.com/r/FDroidUpdates | 2021-11-22

    PDF Viewer (version 7): Security focused PDF Viewer

  • GitHub repo CleanSCAN

    A simple, smart and efficient document scanner for Android

    Project mention: Open source version for scanning pages with your phone? | reddit.com/r/opensource | 2021-05-20

    Clean Scan: https://github.com/clean-apps/CleanSCAN/releases

  • GitHub repo Konik

    A library to create, read and validate ZUGFeRD compliant invoices. Available for Java and .NET

  • GitHub repo i7j-pdfsweep

    Remove content from your digital documents irretrievably instead of just covering it up. Redact text, images, parts of images or drawings for complete confidentiality. iText pdfSweep complies with GDPR for data redaction.

    Project mention: How do layers work in PDF? | reddit.com/r/pdf | 2021-03-22

    The behaviour you want is provided by redaction annotations. These allow you to keep working with the "full" version of the document, and apply the redactions to produce a version without the confidential information. It's a feature in the paid versions of Acrobat and Foxit's PDF viewer, but I'm not aware of any proper, open-source GUI-based tools that can handle that sort of thing. If you're willing to write some code, you can use this: https://github.com/itext/i7j-pdfsweep (it's free as long as you stick to the terms of the AGPL).

  • GitHub repo print

    Android print app (by billthefarmer)

    Project mention: ⟳ 5 apps added, 66 updated at f-droid.org | reddit.com/r/FDroidUpdates | 2021-08-08

    Print: Android print utility

  • GitHub repo Docx2PDF

    Converts Docx (based on OOXML) to PDF.

    Project mention: How to parse and map a Docx file with Java | dev.to | 2021-02-12

    You can also check out one of my other projects, where I create a pdf from a docx, using this approach: https://github.com/e-reznik/Docx2PDF.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-11-29.

Java PDF related posts

Index

What are some of the best open-source PDF projects in Java? This list will help you:

Project Stars
1 OpenPDF 2,159
2 pdfsam 1,818
3 flyingsaucer 1,652
4 grobid 1,599
5 Apache PDFBox 1,542
6 iText 1,311
7 Tabula 1,297
8 Open HTML to PDF 1,297
9 Images-to-PDF 721
10 document-viewer 447
11 CERMINE 386
12 boxable 269
13 Dynamic Jasper 200
14 PDFCreatorAndroid 173
15 Apache FOP 134
16 PdfViewer 132
17 CleanSCAN 110
18 Konik 38
19 i7j-pdfsweep 21
20 print 8
21 Docx2PDF 2
Find remote jobs at our new job board 99remotejobs.com. There are 33 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com