HTML rag

Open-source HTML projects categorized as rag

Top 8 HTML rag Projects

  1. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  2. html-to-markdown

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR. (by kreuzberg-dev)

    Project mention: HTML to Markdown Converter | news.ycombinator.com | 2025-11-10
  3. confabulations

    Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.

    Project mention: I don't know how you get here from "predict the next word." | news.ycombinator.com | 2026-02-26

    Well here's some:

    Confabulation/Hallucination - https://github.com/lechmazur/confabulations

    Failure to read context - https://georggrab.net/content/opus46retrieval.html

    Deleting tests to make them pass - https://www.linkedin.com/posts/jasongorman_and-after-it-did-...

    Going rogue and deleting data - https://x.com/jasonlk/status/1946069562723897802

    Agent security nightmares because they are not in fact intelligent assistants - https://x.com/theonejvo/status/2015401219746128322

    Failure to read or generate structured data - https://support.google.com/gemini/thread/390981629/llm-ignor...

    There are many, many examples, mostly caused by people thinking LLMs are intelligent and reasoning and giving them too much power (e.g. treating them as agents, not text generators). I'm sure they're all fixed in whatever new version came out this week though.

  4. remembra

    Universal memory layer for AI applications. Self-host in minutes. Open source.

    Project mention: Remembra – Open-source semantic memory for AI agents | news.ycombinator.com | 2026-03-11
  5. stdm

    Self Thinking Data Manifest (by csiro)

  6. pocket-flow-framework

    LLM Framework for LLMs (by helenaeverleyz)

  7. Nexus

    NEXUS is an open-source, multi-persona AI agent orchestration platform & delegation framework. Integrates a Chief of Staff with specialized agents (Travel, Research, Legal, Finance, Vision, etc) directly into Web, Slack, & OpenClaw. Built on Bun, Hono, React, and TypeScript. (by Poi5eN)

    Project mention: How to Build a Real-Time Slack Agent Using Bun, Hono, and Event-Driven Orchestration | dev.to | 2026-06-10

    ⭐ GitHub: https://github.com/Poi5eN/Nexus 🎯 Live Demo: https://saarlabs.in

  8. pkc-mark-benchmark

    A local AI benchmark tool for testing LLM, Diffusers, and Transformers models.

    Project mention: Show HN: PKC Mark – open-source local benchmark for LLMs and Diffusers | news.ycombinator.com | 2025-12-01
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

HTML rag discussion

Log in or Post with

Index

What are some of the best open-source rag projects in HTML? This list will help you:

# Project Stars
1 awesome-ai-web-search 1,339
2 html-to-markdown 765
3 confabulations 247
4 remembra 14
5 stdm 10
6 pocket-flow-framework 4
7 Nexus 2
8 pkc-mark-benchmark 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that HTML is
the 9th most popular programming language
based on number of references?