SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 JavaScript PDF Projects
-
koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
Project mention: Foliate: Read e-books in style, navigate with ease | news.ycombinator.com | 2024-07-18https://github.com/koodo-reader/koodo-reader/commits/master/
I'm really not confortable with projects that treat the Git history as junk, to the point I'd rather avoid that software. I daily use the e-reader Koreader, and sometimes Plato, and their source code and Git history are clear and documented (though I think Koreader's choice of Lua is poorly suited to the task). I could patch Koreader to my taste and send pull requests.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Official React PDF Renderer Docs
-
Project mention: Show HN: Documind – Open-source AI tool to turn documents into structured data | news.ycombinator.com | 2024-11-18
From the source, Documind appears to:
1) Install tools like Ghostscript, GraphicsMagick, and LibreOffice with a JS script. 2) Convert document pages to Base64 PNGs and send them to OpenAI for data extraction. 3) Use Supabase for unclear reasons.
Some issues with this approach:
* OpenAI may retain and use your data for training, raising privacy concerns [1].
* Dependencies should be managed with Docker or package managers like Nix or Pixi, which are more robust. Example: a tool like Parsr [2] provides a Dockerized pdf-to-json solution, complete with OCR support and an HTTP api.
* GPT-4 vision seems like a costly, error-prone, and unreliable solution, not really suited for extracting data from sensitive docs like invoices, without review.
* Traditional methods (PDF parsers with OCR support) are cheaper, more reliable, and avoid retention risks for this particular use case. Although these tools do require some plumbing... probably LLMs can really help with that!
While there are plenty of tools for structured data extraction, I think there’s still room for a streamlined, all-in-one solution. This gap likely explains the abundance of closed-source commercial options tackling this very challenge.
---
1: https://platform.openai.com/docs/models#how-we-use-your-data
2: https://github.com/axa-group/Parsr
-
Project mention: Show HN: Epublifier – scrape pages (books, manuals) for offline reading | news.ycombinator.com | 2024-10-21
For those interested in a simple to use command line tool that accomplishes the same I've had success with percollate - https://github.com/danburzo/percollate
-
node-html-pdf
This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.
node-html-pdf: the library is simple and easy to use, one simple command and you get your PDF. However, the library is no longer maintained and we expect to have issue with complex layouts. Also, even if the HTML document include already all information about the layout, you may have to add some optionto make it work correctly as shown in the .README. So while being simple, it requires manual fintuning for each document.
-
pdf-bot
🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about generated PDFs
-
-
-
-
-
-
-
-
pdfjs
A Portable Document Format (PDF) generation library targeting both the server- and client-side.
-
-
I've been test-driving the web pdf build tool for Asciidoc, asciidoctor-web-pdf[1], for a few years, which uses Paged.js as the template engine before CSS PMM has its go. I like it - I like it a LOT[2] - but Puppeteer-Chrome bugs breaks the build on the regular, or requires a rework of templates. So the web-pdf team started just releasing docker images that include a tested Chromium version (among other things), so as to keep that from being such a PITA. Which is fine. Howaaaayyyyyyyver . . that shines a spotlight on a problem with this workflow: the dependency on browser rendering kit.
[1] https://github.com/ggrossetie/asciidoctor-web-pdf
-
chai
chai - Experience Zero Trust security with Chai! Convert and view documents as vivid images right in your browser. No mandatory downloads, no hassle—just pure, joyful security! 🌈 (by dosyago)
But not just off-topic: abusive, and dishonest.
I'm not sure this even applies as we call the mutool binary installed via apt, rather than use or modify their libraries.
Even if it applies, Mu's AGPL requires you release the source code, which is what we already and have always done. So it doesn't apply. It doesn't require you use a particular license.
As you're so keen on searching our source you could have also easily read what the AGPL means, and seen that we use mutool^0, which I guess you would have done, if you were actually intending to be helpful rather than just trying to make us look bad, right? Hahaha! :)
I guess you're one of those people bitter at our success or maybe you were trying to use BrowserBox without paying the licensing fees and you didn't like that we made it commercial, is that right? Hahaha! :)
It seems if you were genuinely trying to be helpful rather than dishonest and trying to make us look bad, you would have just emailed me, right? Hahahahaha! :)
https://github.com/dosyago/chai/blob/37c1a1ec0941d81e0d6f8af...
-
-
-
jspdf-invoice-template
PDF template created to generate invoices based on props object. Using jsPDF library.
-
twindle
Twindle - an open source project for beginners. Converting twitter threads to pdf, epub, and mobi format to be read by Kindle.
-
JavaScript PDF discussion
JavaScript PDF related posts
-
How to Create PDFs using React JS
-
Creating a Remix Server to Return a React Page as a PDF
-
Show HN: Open-source Reactjs lib for create PDF files
-
TOP React Liabraries and tools you Need
-
DEMO - Voice to PDF - Complete PDF documents with voice commands using the Claude 3 Opus API
-
How we improved our client-side PDF generation by 5x
-
Ask HN: Can 2 ledgers generate the same seed phrase out of millions out there?
-
A note from our sponsor - SaaSHub
www.saashub.com | 13 Jan 2025
Index
What are some of the best open-source PDF projects in JavaScript? This list will help you:
Project | Stars | |
---|---|---|
1 | koodo-reader | 20,345 |
2 | react-pdf | 15,153 |
3 | Parsr | 5,860 |
4 | percollate | 4,350 |
5 | node-html-pdf | 3,558 |
6 | pdf-bot | 2,629 |
7 | decktape | 2,215 |
8 | universal-resume | 1,741 |
9 | obsidian-annotator | 1,535 |
10 | md2pdf | 1,478 |
11 | Buka | 1,089 |
12 | vscode-markdown-pdf | 1,025 |
13 | OpenComic | 997 |
14 | pdfjs | 785 |
15 | libresign | 500 |
16 | asciidoctor-web-pdf | 458 |
17 | chai | 364 |
18 | starter-book | 339 |
19 | pdf-annotate.js | 270 |
20 | chrome-headless-render-pdf | 225 |
21 | jspdf-invoice-template | 148 |
22 | twindle | 132 |
23 | bookworm | 137 |