rag

Top 23 rag Open-Source Projects

  1. dify

    Production-ready platform for agentic workflow development.

    Project mention: Bringing MongoDB Atlas and Voyage AI to Dify: Build RAG Workflows and Data Agents Without Heavy Glue Code | dev.to | 2026-05-31

    The MongoDB extensions for Dify help close that gap.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. open-webui

    User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

    Project mention: Quick and easy local AI RAG setup with JetBrains IDE integration and browser UI | dev.to | 2026-06-02

    To get a local web UI (that is very similar to CharGPT) that supports Retrieval Augmented Generation (RAG), workflows and many other features, we'll use Open WebUI (https://github.com/open-webui/open-webui). Although it can be setup using locally installed Python, I've decided to try out their Docker image instead. Since I have an Nvidia card, I've used their Nvidia GPU supported docker image.

  4. awesome-llm-apps

    100+ AI Agent & RAG apps you can actually run — clone, customize, ship.

    Project mention: Awesome LLM Apps: Agents & RAG Showcase | dev.to | 2026-03-30

    Explore the repository and contribute to the future of AI: https://github.com/Shubhamsaboo/awesome-llm-apps

  5. ragflow

    RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

    Project mention: I Scanned 5 Popular Open-Source AI Projects for EU AI Act Compliance. Here's What I Found. | dev.to | 2026-03-31

    I ran AIR Blackbox (the scanner itself), Browser Use (79K+ stars), RAGFlow (76K+ stars), LiteLLM (23K+ stars), and Superlinked (15K+ stars) through the same compliance checks.

  6. PaddleOCR

    Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

    Project mention: NousResearch Agent, Open-Source Notebook LM, & Local Multimodal OCR for Consumer GPUs | dev.to | 2026-06-04
  7. LobeHub

    The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you.

    Project mention: Show HN: AI Roundtable – Let 200 models debate your question | news.ycombinator.com | 2026-03-24

    You can set this up yourself with API keys to the corresponding providers and creating an Agent Group in https://github.com/lobehub/lobehub. Agent groups allow you to easily create a room of agents and have them discuss any of your topics. Easily make agents with types and skills, it even assists in drafting starting prompts and even team members depending what your query (and selected model) is.

    You can self-host as well, but not via desktop app. Sever setup required.

    Be careful of your token context, you can easily rack up costs if you leave Opus selected as the model and get lost in some rabbit hole of results.

    Enjoy enjoy!

  8. Prompt-Engineering-Guide

    🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

    Project mention: Your AI is not bad, your instructions are | dev.to | 2026-05-22

    Prompt Engineering Guide

  9. anything-llm

    Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

    Project mention: NVIDIA RTX Spark: What the Backlash Gets Wrong About AI on Your Desktop [2026] | dev.to | 2026-06-04

    The headline marketing number is "1 petaflop" of AI performance. Sounds staggering. Tim Carambat, creator of AnythingLLM and one of the most credible voices in the local AI developer community, has already questioned this figure. His point is one I've validated repeatedly in my own benchmarking: for running large language models locally, memory bandwidth is the actual bottleneck, not raw FLOPS. You can have all the tensor cores in the world, but if you can't feed them data fast enough, your Llama 3 inference is still going to crawl.

  10. llm-app

    Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

  11. mem0

    Universal memory layer for AI Agents

    Project mention: mem0 alternatives - MemClaw and Statewave | libhunt.com/r/mem0 | 2026-06-03
  12. Flowise

    Build AI Agents, Visually

    Project mention: I Tested Flowise, Dify, and n8n Across 30+ Client Deployments. Here Is My Verdict. | dev.to | 2026-04-07

    Citation Capsule: n8n's GitHub community reached 182,000+ stars across a 7-year development history, with 70+ AI-specific nodes added in 2024 to 2025. Source: n8n GitHub. Dify crossed 106,000 stars on GitHub with an Apache 2.0 license. Source: Dify GitHub. Flowise reached 51,000+ stars with MIT license. Source: Flowise GitHub. Dify's minimum recommended RAM is 4 GB versus Flowise's 1 GB and n8n's 300 MB. Source: Dify Docs.

  13. llama_index

    LlamaIndex is the leading document agent and OCR platform

    Project mention: Anthropic's 10 Finance Agents: A Buyer's Guide for Banks | dev.to | 2026-05-05

    Open-source alternative: The DIY pattern is a LlamaIndex RAG pipeline over your CRM + filings + news. Real, but takes a quarter to ship and 18 months to make trust-grade.

  14. JeecgBoot

    AI 低代码平台「低代码 + 零代码」双驱动!低代码可一键生成前后端代码;零代码可 5 分钟搭建系统;AI Skills 一句话画流程、设计表单、生成整套系统。内置 AI聊天、知识库、流程编排、MCP插件等,兼容主流大模型。引领「AI 生成 → 在线配置 → 代码生成 → 手工合并->AI修改」开发模式,消除 Java 项目 80% 的重复工作,提效而不失灵活。

  15. Milvus

    Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

    Project mention: The AI stack every developer will depend on in 2026 | dev.to | 2026-05-19

    Milvus: Optimized for large-scale, distributed memory operations

  16. MindsDB

    Platform dedicated to building an open foundation for applied Artificial Intelligence, designed for people seeking production-ready AI systems they can truly control, extend and deploy anywhere.

    Project mention: MindsDB Supercharges Google's MCP Toolbox with Unstructured Data Support | dev.to | 2025-12-29

    We’re happy to announce that we’ve integrated MindsDB with Google's open-source project, MCP (Model Context Protocol) Toolbox. This will make your AI applications very, very smart. This enhancement expands the Toolbox's reach, especially for organizations grappling with lots of siloed data.

  17. quivr

    Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

  18. LightRAG

    [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

    Project mention: Show HN: Query years of Ask HN and Show HN discussions as local knowledge graph | news.ycombinator.com | 2026-05-10

    I built lightrag-snkv, Basically it uses lightRAG https://github.com/HKUDS/LightRAG ,this requires various storage databases like key value store, graph database, vector database, I built single embedded file based database which covers all these requirements: https://github.com/hash-anu/snkv.

  19. Vane

    Vane is an AI-powered answering engine.

    Project mention: AI Infrastructure on Consumer Hardware | dev.to | 2025-11-21

    Perplexica - Self-hosted AI search

  20. khoj

    Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

    Project mention: 25 Trending Self-Hosted Projects on GitHub | dev.to | 2026-04-02
  21. langgraph

    Build resilient agents.

    Project mention: Agent-Ready Engineering Infrastructure | dev.to | 2026-05-05

    LangGraph AGENTS.md

  22. graphrag

    A modular graph-based Retrieval-Augmented Generation (RAG) system

    Project mention: Graph RAG Isn't a One-Shot Anymore — The Case for Agentic Graph RAG MCPs | dev.to | 2026-05-07

    The most well-known implementation right now is Microsoft's GraphRAG, released in 2024. The papers are well-written and I have a lot of respect for it. But the design philosophy is squarely from the one-shot retrieval era.

  23. PageIndex

    📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

    Project mention: AI Builder Notes - May 2026 | dev.to | 2026-06-01

    Birdclaw is interesting because it gives agents access to a Twitter archive. [17] GBrain points at a personal recall layer around OpenClaw / Hermes-style workflows. [18] PageIndex is a useful reminder that simple retrieval, even BM25-only retrieval, still has a place. [19] The “RAG comeback in about 8 months” take lands because the archive problem is still unsolved in practice. [20]

  24. onyx

    Open Source AI Platform - AI Chat with advanced features that works with every LLM

    Project mention: Building an AI Context Layer for Engineering Teams | dev.to | 2026-04-05

    Onyx (formerly Danswer) is open source, self-hosted, and has pre-built connectors for Confluence, Jira, GitHub, Slack, and Google Drive. You deploy it with Docker, point it at your Atlassian and GitHub credentials, and it handles crawling, chunking, embedding, and incremental sync.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

rag discussion

Log in or Post with

rag related posts

  • mem0 alternatives - MemClaw and Statewave

    3 projects | 3 Jun 2026
  • NVIDIA RTX Spark: What the Backlash Gets Wrong About AI on Your Desktop [2026]

    1 project | dev.to | 4 Jun 2026
  • Open-source AI toolkit for e-commerce

    1 project | news.ycombinator.com | 4 Jun 2026
  • OpenAI models on Bedrock make AI deployment less messy

    1 project | dev.to | 2 Jun 2026
  • What I Learned Building a Local RAG Agent

    1 project | dev.to | 1 Jun 2026
  • I Tested 33 AI Memory Engines — Here's What Actually Works

    6 projects | dev.to | 28 May 2026
  • heym alternatives - n8n and sim

    3 projects | 15 May 2026
  • A note from our sponsor - SaaSHub
    www.saashub.com | 7 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source rag projects? This list will help you:

# Project Stars
1 dify 143,689
2 open-webui 139,852
3 awesome-llm-apps 113,059
4 ragflow 81,919
5 PaddleOCR 79,706
6 LobeHub 78,186
7 Prompt-Engineering-Guide 75,276
8 anything-llm 61,083
9 llm-app 59,431
10 mem0 57,631
11 Flowise 53,317
12 llama_index 49,924
13 JeecgBoot 46,605
14 Milvus 44,649
15 MindsDB 39,243
16 quivr 39,171
17 LightRAG 36,193
18 Vane 35,178
19 khoj 34,892
20 langgraph 33,889
21 graphrag 33,458
22 PageIndex 32,583
23 onyx 30,038

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?