documents4j
pandoc
documents4j | pandoc | |
---|---|---|
1 | 432 | |
558 | 34,922 | |
0.9% | - | |
0.0 | 9.8 | |
5 months ago | 8 days ago | |
Rich Text Format | Haskell | |
Apache License 2.0 | GNU General Public License v2.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
documents4j
-
Self-hosted API for converting MS Word documents to PDF
Take a look at https://documents4j.com. I'm not sure if it checks all your boxes. I have never set it up my self, but know it to be reliable in a production environment.
pandoc
-
NOTE: How to Install LaTeX and Pandoc on Ubuntu
wget https://github.com/jgm/pandoc/releases/download/3.5/pandoc-3.5-1-arm64.deb sudo dpkg -i pandoc-3.5-1-arm64.deb
-
The biggest blocker to LibreOffice adoption? LibreOffice
Send your publisher a .docx file made by converting a .md using Pandoc:
https://pandoc.org/
I used to work in publishing --- Word docs were hacked at by editors since it was their standard tool, then they would get flowed into PageMaker, or Quark XPress, or Ventura Publisher, or Adobe InDesign --- except of course for the LaTeX docs.
For .docx, the converted file is fine, so long as it has all the text and bold/italics which the author wants.
- MdBook – a command line tool to create books with Markdown
-
Top FP technologies
pandoc
-
WordPress Alternatives
I think yours is the candidate for Pandoc[1] or something like Soupault[2]. But you will be doing the HTML/CSS writing yourself.
1. https://pandoc.org
2. https://soupault.app
- John Carmack on Inlined Code
- From Gatsby gridlock to Astro bliss: my personal site redesign
-
Executable Blog Posts: Second Take
pandoc has a feature called filters. These filters are small programs that can manipulate the AST of the document. They can be written in any language, but the most common language is Lua as its interpreter is embedded in pandoc, and it is faster compared to the JSON filter interface which is also used by other languages (You may wish to listen to the Episode 37 of The Haskell Interlude podcast, where Joachim Breitner and David Christiansen interview John MacFarlane, the creator of pandoc, where he mentions Lua vs JSON filters).
- Why I Prefer RST to Markdown
- Ask HN: What are you using to parse PDFs for RAG?
What are some alternatives?
docx4j - JAXB-based Java library for Word docx, Powerpoint pptx, and Excel xlsx files
pandoc-highlighting-extensions - Extensions to Pandoc syntax highlighting
Apache POI - Mirror of Apache POI
obsidian-html - :file_cabinet: A simple tool to convert an Obsidian vault into a static directory of HTML files.
Aspose.Words-for-.NET - Aspose.Words for .NET examples, plugins and showcases
obsidian-export - Rust library and CLI to export an Obsidian vault to regular Markdown
OfficeToPDF - A command line tool to convert Microsoft Office documents to PDFs
Obsidian-MD-To-PDF - A command line python script to convert Obsidian md files to a pdf
gotenberg - A developer-friendly API for converting numerous document formats into PDF files, and more!
kramdown - kramdown is a fast, pure Ruby Markdown superset converter, using a strict syntax definition and supporting several common extensions.
wavedrom - :ocean: Digital timing diagram rendering engine
vimwiki - Personal Wiki for Vim