html-to-markdown

⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules. (by JohannesKaufmann)

Html-to-markdown Alternatives

Similar projects and alternatives to html-to-markdown

  1. servers

    79 html-to-markdown VS servers

    Model Context Protocol Servers

  2. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  3. readability

    A standalone version of the readability lib

  4. litellm

    Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

  5. browser-use

    Make websites accessible for AI agents

  6. to-markdown

    🛏 An HTML to Markdown converter written in JavaScript

  7. fastapi_mcp

    Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

  8. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  9. toml

    TOML parser for Golang with reflection. (by BurntSushi)

  10. ODF

    Open Document Format (ODF) generator library for Go.

  11. micro-editor

    A modern and intuitive terminal-based text editor

  12. xpath

    XPath package for Golang, supports HTML, XML, JSON document query.

  13. posting

    The modern API client that lives in your terminal.

  14. amphi-etl

    5 html-to-markdown VS amphi-etl

    Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.

  15. cat

    Extract text from plaintext, .docx, .odt and .rtf files. Pure go.

  16. flexmark-java

    CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.

  17. pagser

    Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler

  18. go-vcard

    A Go library to parse and format vCard

  19. htmd

    A turndown.js inspired HTML to Markdown converter for Rust (by letmutex)

  20. did

    A golang package to work with Decentralized Identifiers (DIDs) (by build-trust)

  21. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better html-to-markdown alternative or higher similarity.

html-to-markdown discussion

Log in or Post with

html-to-markdown reviews and mentions

Posts with mentions or reviews of html-to-markdown. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-03-24.
  • Show HN: We made an MCP Server so that Cursor can build anything from API Docs
    5 projects | news.ycombinator.com | 24 Mar 2025
    I'm frequently constructing context based on up-to-date docs using curl + html2markdown[0] and custom css selectors, which is extremely tedious. MCP servers for docs would be very useful for me.

    That said, I don't really expect the AI itself to come up with docs to read (maybe some day). I want it predominantly so I can manually reference it in my prompt (in e.g. the Zed assistant panel) like `/npmdocs packagename packageversion`.

    But even for AI "self-driven" use-cases, I primarily see the value in read-only MCP servers that provide more context, just in an "as-needed" way, instead of me putting it there explicitly.

    [0]: https://github.com/JohannesKaufmann/html-to-markdown

  • AIM Weekly for 11/11/2024
    6 projects | dev.to | 11 Nov 2024
    🌐 All this buildings 📎 Synthetic Data Generator - local on Ollama! 💻 Agentic Mesh Future! 🦾 Convert an entire HTML Website to Markdown with Great GO CLI Tool 🫶 LiteLLM 📝 How to setup InstructLab Granite Model 🌐 Fundamentals of Platform Transforamation Salesforce 🌐 Graphrag Explaing 🌐 Matryoshka Embedding Detail at Multiple Scales 🌐 Browser Use for LLM 📎 Creating Advanced AI Agents with Ollama and Langchain 💻 Great command line REST Client 🤖 Inkeep Builds an AI Assistant with Milvus ✅ Visual Data Transformation ETL 💻 What is a LongRAG 🛠️ PymuPDF4llm is Your New Best Friend 🫶 WebScraping for LLM
  • Show HN: HTML-to-Markdown – convert entire websites to Markdown with Golang/CLI
    6 projects | news.ycombinator.com | 9 Nov 2024
    Yeah good point, that's actually difficult. They use many `` html tags to color individual words and syntax.

    But I wrote logic to handle that. It probably needs to be adapted at some point, but works surprisingly well. Have a look at the testdata files ("code.in.html" and "code.out.md" files [1]).

    Feel free to give it a try & let me know if you notice any edge cases!

    [1] https://github.com/JohannesKaufmann/html-to-markdown/blob/ma...

  • Htmd: A turndown.js inspired HTML-to-Markdown converter for Rust
    4 projects | news.ycombinator.com | 16 Jun 2024
    Note: Six years ago I open sourced a Golang library [1]. Currently I am re-writing it completely with the aim of getting even better than Pandoc. And wrote about the encountered edge-cases [2].

    [1] https://github.com/JohannesKaufmann/html-to-markdown

    [2] https://html-to-markdown.com/edge-cases

  • A note from our sponsor - SaaSHub
    www.saashub.com | 30 Apr 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic html-to-markdown repo stats
5
2,789
8.4
9 days ago

Sponsored
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com

Did you know that Go is
the 4th most popular programming language
based on number of references?