Top 23 PDF Open-Source Projects

  • Awesome-CV

    :page_facing_up: Awesome CV is LaTeX template for your outstanding job application

    Project mention: IT Support --> Entwickler, keine Programmiererfahrung. Könntet ihr mein Lebenslauf bewerten und einpaar Ratschläge geben? | reddit.com/r/arbeitsleben | 2023-04-21
  • best-resume-ever

    :necktie: :briefcase: Build fast :rocket: and easy multiple beautiful resumes and create your best CV ever! Made with Vue and LESS.

    Project mention: Here's what a job search looks like, from someone who was mostly clueless in their first years of work | reddit.com/r/cscareerquestions | 2022-08-23

    On the topic of ATS rejecting resumes, could it be a problem if it was generated as a PDF from a web page? I use this resume generating framework specifically.

  • Appwrite

    Appwrite - The Open Source Firebase alternative introduces iOS support . Appwrite is an open source backend server that helps you build native iOS applications much faster with realtime APIs for authentication, databases, files storage, cloud functions and much more!

  • Etherpad

    Etherpad: A modern really-real-time collaborative document editor.

    Project mention: What Tool Do You Use For Collaborating On Tasks? | reddit.com/r/PrivacyGuides | 2023-03-26

    Did you try https://etherpad.org ?

  • koreader

    An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices

    Project mention: Kindle Touch - Reuse as an always-on agenda | reddit.com/r/opensource | 2023-05-29

    Yes! Actually there's a lot of support for jailbreaking older Kindles out there. I use KOReader on mine, which natively supports RSS/Atom (as well as Wallabag).

  • react-pdf

    📄 Create PDF files using React

    Project mention: 🎉Top React Libraries for Developers | dev.to | 2023-05-18

    React PDF is a JS library for generating PDF documents in React apps. It provides a set of React components that can be used to create PDFs, making it easier to generate dynamic and data-driven PDF files.

  • mit-deep-learning-book-pdf

    MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

    Project mention: Is supervised machine learning the same as linear regression? | reddit.com/r/learnmachinelearning | 2023-03-07
  • gpt4-pdf-chatbot-langchain

    GPT4 & LangChain Chatbot for large PDF docs

    Project mention: Chat with Documents using Open source LLMs | reddit.com/r/LangChain | 2023-05-27

    https://github.com/mayooear/gpt4-pdf-chatbot-langchain this repo uses gpt-3.5/4 which uses OpenAI API. Is there any work donw with free/open-source LLMs

  • Sonar

    Write Clean JavaScript Code. Always.. Sonar helps you commit clean code every time. With over 300 unique rules to find JavaScript bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • koodo-reader

    A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web

    Project mention: Is there anything that can replace Calibre? | reddit.com/r/selfhosted | 2022-12-08

    My one hope as of now is: https://github.com/troyeguo/koodo-reader

  • sumatrapdf

    SumatraPDF reader

    Project mention: FSF Slams Google over Dropping JPEG-XL in Chrome | news.ycombinator.com | 2023-04-16

    FWIW, https://github.com/sumatrapdfreader/sumatrapdf/issues/1249 ("Support form filling for at least 1040 irs form") has a comment saying "MuPDF-GL has the capability to edit fields and save the PDF" of a 1040.

    I have just learned the Firefox 93 added support XFA - https://techdows.com/2021/10/open-xfa-pdfs-in-firefox.html .

    So it would appear there are free software solutions to XFA forms.

    Just because something doesn't seem far-fetched to you, doesn't mean most people will regard it as far-fetched.

    Many people all sorts of "sectarian objections" - far more than there are SovCits or other tax protesters. Stallman has never come across as a tax protester. Ergo, I think it's far-fetched that "sectarian objections" is strongly associated with tax protests.

    Further, at https://stallman.org/archives/2017-may-aug.html we can read Stallman opinine that we need to "return to the "bad old days", when Americans in general could have a decent life, not penury; when the US could afford to build what the public needed instead of privatizing everything with a toll" by making taxation more progressive. At https://stallman.org/archives/2011-jan-apr.html we read he supports "The Fairness in Taxation Act [which] would raise taxes to 45% on incomes over a million dollars a year."

  • milewski-ctfp-pdf

    Bartosz Milewski's 'Category Theory for Programmers' unofficial PDF and LaTeX source

    Project mention: Reading recomendations on Category Theory | reddit.com/r/math | 2023-04-25

    Milewski's "Category Theory for Programmers".

  • Dompdf

    HTML to PDF converter for PHP

    Project mention: Modern PDF library (no html) | reddit.com/r/PHP | 2023-03-08

    A middle ground is dompdf, an HTML to PDF converter made in PHP. It isn't full HTML/CSS compatible, but is good enough in most cases.

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

    Project mention: Donut: OCR-Free Document Understanding Transformer | news.ycombinator.com | 2023-05-29
  • xournalpp

    Xournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets.

    Project mention: Is it possible to use checkboxes on a pre-typed document? | reddit.com/r/techsupport | 2023-05-22

    You can use Xournal++ to write text or draw over it.

  • Zettlr

    A Markdown Editor for the 21st century.

    Project mention: I read the top ten Zettelkasten articles on Hacker News so you can do something more wholesome with your day | reddit.com/r/Zettelkasten | 2023-05-21

    GitLab software engineer Tomas Vik runs through the slip-box method, based on Sönke Ahrens's book, How to Take Smart Notes. He recommends creating individual plain text (markdown) files and gives clear examples of how this is structured. He used Zettlr as his markdown-enabled text editor of choice, but mentions alternative apps that do similar things. As a bonus, there's a follow-up post a year later, in which the author describes how his process has changed (not much) and why he now uses Logseq instead of Zettlr.

  • libvips

    A fast image processing library with low memory needs.

    Project mention: Go Image Converting | reddit.com/r/golang | 2023-03-09

    h2non/bimg can handle both if the underlying libvips is compiled with support for both formats.

  • react-pdf

    Display PDFs in your React app as easily as if they were images. (by wojtekmaj)

    Project mention: react-pdf not rendering PDF | reddit.com/r/reactjs | 2023-05-26

    I am trying to load a PDF from my src/assets folder, following the example from here I have the following code:

  • url-to-pdf-api

    Web page PDF/PNG rendering done right. Self-hosted service for rendering receipts, invoices, or any content.

  • QuestPDF

    QuestPDF is a modern open-source .NET library for PDF document generation. Offering comprehensive layout engine powered by concise and discoverable C# Fluent API. Easily generate PDF reports, invoices, exports, etc.

    Project mention: How to generate PDFs in react? | reddit.com/r/reactjs | 2023-05-17

    I used that same library it worked great the only issue I had was the users would often have to manually set the scaling to fit to a page. I'm sure I could've fixed this in other ways if I was more competent with CSS but ended up just switching to use https://github.com/QuestPDF/QuestPDF in a backend instead of doing everything in front end.

  • PHPWord

    A pure PHP library for reading and writing word processing documents

    Project mention: Reading accents in doc files | reddit.com/r/PHPhelp | 2023-01-27

    If the files are in UTF-8 there shouldn't be encoding issues, but I found this encoding issue from 2017 in the github for PHPWord: https://github.com/PHPOffice/PHPWord/issues/1172 which may help.

  • quivr

    Dump all your files and thoughts into your GenerativeAI Second Brain and chat with it

    Project mention: Quivr – Your Second Brain, Empowered by Generative AI | news.ycombinator.com | 2023-05-29
  • tabula

    Tabula is a tool for liberating data tables trapped inside PDF files

    Project mention: Automatisches Auslesen von PDFs | reddit.com/r/de_EDV | 2023-05-16
  • laravel-dompdf

    A DOMPDF Wrapper for Laravel

    Project mention: Creating PDF Files in Laravel 9: A Step-by-Step Guide using DomPDF | dev.to | 2023-01-27

    In conclusion, creating PDF files in Laravel 9 is a simple task that can be accomplished using the laravel-dompdf package. By following the steps outlined in this tutorial, you can easily generate PDFs for your web application. This can be a great feature for creating invoices, reports, and other types of documents, with the added benefit of being SEO optimized.

  • WeasyPrint

    The awesome document factory

    Project mention: Library to convert HTML to pdf in Golang | reddit.com/r/golang | 2023-05-22

    In a recent project I used https://github.com/Kozea/WeasyPrint/ it is written in python, so you will need to use it like so:


    ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-29.

PDF related posts


What are some of the best open-source PDF projects? This list will help you:

Project Stars
1 Awesome-CV 19,235
2 best-resume-ever 15,975
3 Etherpad 14,300
4 koreader 12,862
5 react-pdf 12,329
6 mit-deep-learning-book-pdf 11,230
7 gpt4-pdf-chatbot-langchain 10,858
8 koodo-reader 10,613
9 sumatrapdf 10,322
10 milewski-ctfp-pdf 10,201
11 Dompdf 9,680
12 OCRmyPDF 8,920
13 xournalpp 8,562
14 Zettlr 8,473
15 libvips 7,977
16 react-pdf 7,145
17 url-to-pdf-api 6,818
18 QuestPDF 6,749
19 PHPWord 6,690
20 quivr 6,273
21 tabula 6,102
22 laravel-dompdf 5,882
23 WeasyPrint 5,833
TestGPT | Generating meaningful tests for busy devs
Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.