SaaSHub helps you find the best software and product alternatives Learn more →
Html-to-markdown Alternatives
Similar projects and alternatives to html-to-markdown
-
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
-
-
-
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
-
-
-
-
-
-
flexmark-java
CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.
-
pagser
Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
html-to-markdown discussion
html-to-markdown reviews and mentions
-
Show HN: We made an MCP Server so that Cursor can build anything from API Docs
I'm frequently constructing context based on up-to-date docs using curl + html2markdown[0] and custom css selectors, which is extremely tedious. MCP servers for docs would be very useful for me.
That said, I don't really expect the AI itself to come up with docs to read (maybe some day). I want it predominantly so I can manually reference it in my prompt (in e.g. the Zed assistant panel) like `/npmdocs packagename packageversion`.
But even for AI "self-driven" use-cases, I primarily see the value in read-only MCP servers that provide more context, just in an "as-needed" way, instead of me putting it there explicitly.
[0]: https://github.com/JohannesKaufmann/html-to-markdown
-
AIM Weekly for 11/11/2024
🌐 All this buildings 📎 Synthetic Data Generator - local on Ollama! 💻 Agentic Mesh Future! 🦾 Convert an entire HTML Website to Markdown with Great GO CLI Tool 🫶 LiteLLM 📝 How to setup InstructLab Granite Model 🌐 Fundamentals of Platform Transforamation Salesforce 🌐 Graphrag Explaing 🌐 Matryoshka Embedding Detail at Multiple Scales 🌐 Browser Use for LLM 📎 Creating Advanced AI Agents with Ollama and Langchain 💻 Great command line REST Client 🤖 Inkeep Builds an AI Assistant with Milvus ✅ Visual Data Transformation ETL 💻 What is a LongRAG 🛠️ PymuPDF4llm is Your New Best Friend 🫶 WebScraping for LLM
-
Show HN: HTML-to-Markdown – convert entire websites to Markdown with Golang/CLI
Yeah good point, that's actually difficult. They use many `` html tags to color individual words and syntax.
But I wrote logic to handle that. It probably needs to be adapted at some point, but works surprisingly well. Have a look at the testdata files ("code.in.html" and "code.out.md" files [1]).
Feel free to give it a try & let me know if you notice any edge cases!
[1] https://github.com/JohannesKaufmann/html-to-markdown/blob/ma...
-
Htmd: A turndown.js inspired HTML-to-Markdown converter for Rust
Note: Six years ago I open sourced a Golang library [1]. Currently I am re-writing it completely with the aim of getting even better than Pandoc. And wrote about the encountered edge-cases [2].
[1] https://github.com/JohannesKaufmann/html-to-markdown
[2] https://html-to-markdown.com/edge-cases
-
A note from our sponsor - SaaSHub
www.saashub.com | 30 Apr 2025
Stats
JohannesKaufmann/html-to-markdown is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of html-to-markdown is Go.