PDF.js | pandoc | |
---|---|---|
84 | 420 | |
46,332 | 32,449 | |
1.1% | - | |
9.9 | 9.8 | |
2 days ago | 3 days ago | |
JavaScript | Haskell | |
Apache License 2.0 | GNU General Public License v2.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PDF.js
-
DEMO - Voice to PDF - Complete PDF documents with voice commands using the Claude 3 Opus API
readPdf: used for reading the dropped file and displaying it on the screen, it uses PDF.js to load the file, get all fields and display it on the browser.
-
Building W-9 Crafter
I first started building the app in the browser, using PDF.js and Download.js to take a PDF and edit it, and then download it to your computer.
-
Parsing PDFs in Node.js
pdf2json is a module that transforms PDF files from binary to JSON format, using pdf.js for its core functionality. It also incorporates support for interactive form elements, enhancing its utility in processing and interpreting PDF content.
-
Is it possible to port Edge's PDF Editor to other browsers or make your own custom one?
Why not PDF.js?
-
How to Write a Cold Email
I'd think opening a PDF in your browser would be at the same risk-level you associate with going to any random URL. On Firefox at least, I'm pretty sure the built-in PDF viewer is simply JS parsing and rendering the PDF anyway -- nothing with elevated permissions:
https://mozilla.github.io/pdf.js/
-
Firefox 119 unleashes PDF prowess and Sync sorcery
The PDF features are actually an extension, just one built in as Firefox's default pdf viewer.
It's called pdf.js https://github.com/mozilla/pdf.js/
You can actually use this pdf viewer in another browser like Chrome if you'd like, there's a demo URL on there.
-
PDF Chat with Node.js, OpenAI and ModelFusion
We use Mozilla's PDF.js via the pdfjs-dist NPM module to load pages from a PDF file. The loadPdfPages function reads the PDF file and extracts its content. It returns an array where each object contains the page number and the text of that page.
- Ask HN: Best toolkit to build custom pdf viewer?
-
Microsoft faces antitrust scrutiny from the EU over Teams, Office 365
The problem is that there simply wasn't a better option at the time.
Ogg Vorbis was a novelty at best, and it was the only decently widely adopted open source competitor for any of the items listed that was available at the time.
HTML5 was only just published when Chrome launched. So Flash was at that point the only option available to show a video in the browser (sure, downloading a RealPlayer file was always an option, but it was clunky, creators didn't like people being able to save stuff locally, and was also not open source). Chrome in fact arguably accelerated the process of getting web video open sourced: Google bought On2 in 2010 to get the rights to VP8 (the only decent H.264 competitor available at that point) so they could immediately open source it. The plan was in fact to remove H.264 from Chrome entirely once VP8/VP9 adoption ramped up[1], but that didn’t end up happening.
Flash was integrated into Chrome because people were going to use it anyway, and having Google distribute it at least let them both sandbox it and roll out automatic updates (a massive vector for malware at the time was ads pretending to be Flash updates, which worked because people were just that used to constant Flash security patches, most of which required a full reboot to apply; Chrome fixed both of those issues). Apple are the ones who ultimately dealt the death blow to Flash, and it was really just because Adobe could not optimize it for phone CPUs no matter what they tried (even the few Android releases of Flash that we got were practically unusable). That also further accelerated the adoption of open source HTML5 technologies.
PDF is an open source format, and has been since 2008. While I don't know if pressure from Google is what did it, that wouldn’t surprise me. Regardless, the Chrome PDF reader, PDFium, is open source[2] and Mozilla's equivalent project from 2011, PDF.js, is also open source.[3] Both of these projects replaced the distinctly closed source Adobe Reader plugin that was formerly mandatory for viewing PDFs in the browser.
Chrome is directly responsible for eliminating a lot of proprietary software from mainstream use and replacing it with high-quality open source tools. While they've caused problems in other areas of browser development that are worthy of criticism, Chrome's track record when it comes to open sourcing their tech has been very good.
[1]: https://blog.chromium.org/2011/01/html-video-codec-support-i...
[2]: https://github.com/chromium/pdfium
[3]: https://github.com/mozilla/pdf.js
-
How do Fix this issue while trying to save an edited PDF? (text gets really small and is rotated)(i'm using nightly)
Firefox Nightly is an unstable test version. You should report PDF issues to this GitHub repository.
pandoc
-
Beautifying Org Mode in Emacs (2018)
My main authoring tool is then Emacs Markdown Mode (https://jblevins.org/projects/markdown-mode/). For data entry, it comes with some bells and whistles similar to org-mode, like C-c C-l for inserting links etc.
I seldom export my notes for external usage, but if it is the case, I use lowdown (https://kristaps.bsd.lv/lowdown/) which also comes with some nice output targets (among the more unusual are Groff and Terminal). Of cource pandoc (https://pandoc.org/) does a very good job here, too.
-
Show HN: I made a tool to clean and convert any webpage to Markdown
This is one of those things that the ever-amazing pandoc (https://pandoc.org/) does very well, on top of supporting virtually every other document format.
-
LaTeX makes me so angry at word
Folks feel the same way about Markdown versus LaTeX: why use something significantly more complicated where a looser, human-readable grammar works better?
For any other situations, I use https://pandoc.org/, or, generate a Word doc scriptomatically.
-
📓 Versionner et builder l'eBook de son Entretien Annuel d'Evaluation sur Git(Hub)
pandoc toolchain pour builder une version confortable/imprimable en phase de travail (ePub, pdf, docx, html)
-
Launch HN: Onedoc (YC W24) – A better way to create PDFs
Congrats on the launch, I guess, but there are so many free options that I can't think of a situation where paying $0.25 per document would be justified...? Just to name a few:
Back in the days, I used to use XSL-FO [0] and it was okay. It was not very precise but it rarely if ever broke, and was perfectly integrated with an XML/XSLT solution. Yeah, this was a long time ago.
Last month I used html-to-pdfmake [1] and it's also not very precise and more fragile, but very efficient and fast.
Yet another approach would be to pro grammatically generate .rtf files (for example) and use Pandoc [2] to produce PDFs (I have not tried this in production but don't see why it wouldn't work).
[0] https://en.wikipedia.org/wiki/XSL_Formatting_Objects
[1] https://www.npmjs.com/package/html-to-pdfmake
[2] https://pandoc.org/
-
Ask HN: Looking for lightweight personal blogging platform
Others have mentioned static site generators. I like Hakyll [1] because it can tightly integrate with Pandoc [2] and allows you to develop custom solutions if your needs ever grow.
[1]: https://jaspervdj.be/hakyll/
[2]: https://pandoc.org/
-
Show HN: CLI for generating beautiful PDF for offline reading
Have you compared it with a conversion by pandoc (https://pandoc.org/)?
-
Pandoc
I have used it to kickstart a blogging project that I wish to come back to soon. The Lua inter-op for custom readers, writers and filters is great but I wish there was more editor integration and even perhaps an official IDE/editor with built-in debugging features (probably something already do-able with Emacs but I haven't checked). The only blocker for my project is no support for "ChunkedDoc" for Lua filters [1] which forces me to write more code and a complicated Makefile.
[1]: https://github.com/jgm/pandoc/issues/9061
- I don't always use LaTeX, but when I do, I compile to HTML (2013)
- What Happened to Pandoc-Discuss?
What are some alternatives?
jsPDF - Client-side JavaScript PDF generation for everyone.
pandoc-highlighting-extensions - Extensions to Pandoc syntax highlighting
pdfmake - Client/server side PDF printing in pure JavaScript
obsidian-html - :file_cabinet: A simple tool to convert an Obsidian vault into a static directory of HTML files.
PDFKit - A JavaScript PDF generation library for Node and the browser
obsidian-export - Rust library and CLI to export an Obsidian vault to regular Markdown
Papa Parse - Fast and powerful CSV (delimited text) parser that gracefully handles large files and malformed input
Obsidian-MD-To-PDF - A command line python script to convert Obsidian md files to a pdf
diff2html - Pretty diff to html javascript library (diff2html)
kramdown - kramdown is a fast, pure Ruby Markdown superset converter, using a strict syntax definition and supporting several common extensions.
pdf-lib - Create and modify PDF documents in any JavaScript environment
wavedrom - :ocean: Digital timing diagram rendering engine