SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 rag Open-Source Projects
-
Project mention: Bringing MongoDB Atlas and Voyage AI to Dify: Build RAG Workflows and Data Agents Without Heavy Glue Code | dev.to | 2026-05-31
The MongoDB extensions for Dify help close that gap.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Quick and easy local AI RAG setup with JetBrains IDE integration and browser UI | dev.to | 2026-06-02
To get a local web UI (that is very similar to CharGPT) that supports Retrieval Augmented Generation (RAG), workflows and many other features, we'll use Open WebUI (https://github.com/open-webui/open-webui). Although it can be setup using locally installed Python, I've decided to try out their Docker image instead. Since I have an Nvidia card, I've used their Nvidia GPU supported docker image.
-
Explore the repository and contribute to the future of AI: https://github.com/Shubhamsaboo/awesome-llm-apps
-
ragflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Project mention: I Scanned 5 Popular Open-Source AI Projects for EU AI Act Compliance. Here's What I Found. | dev.to | 2026-03-31I ran AIR Blackbox (the scanner itself), Browser Use (79K+ stars), RAGFlow (76K+ stars), LiteLLM (23K+ stars), and Superlinked (15K+ stars) through the same compliance checks.
-
PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Project mention: NousResearch Agent, Open-Source Notebook LM, & Local Multimodal OCR for Consumer GPUs | dev.to | 2026-06-04 -
LobeHub
The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you.
Project mention: Show HN: AI Roundtable – Let 200 models debate your question | news.ycombinator.com | 2026-03-24You can set this up yourself with API keys to the corresponding providers and creating an Agent Group in https://github.com/lobehub/lobehub. Agent groups allow you to easily create a room of agents and have them discuss any of your topics. Easily make agents with types and skills, it even assists in drafting starting prompts and even team members depending what your query (and selected model) is.
You can self-host as well, but not via desktop app. Sever setup required.
Be careful of your token context, you can easily rack up costs if you leave Opus selected as the model and get lost in some rabbit hole of results.
Enjoy enjoy!
-
Prompt-Engineering-Guide
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Prompt Engineering Guide
-
anything-llm
Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience
Project mention: NVIDIA RTX Spark: What the Backlash Gets Wrong About AI on Your Desktop [2026] | dev.to | 2026-06-04The headline marketing number is "1 petaflop" of AI performance. Sounds staggering. Tim Carambat, creator of AnythingLLM and one of the most credible voices in the local AI developer community, has already questioned this figure. His point is one I've validated repeatedly in my own benchmarking: for running large language models locally, memory bandwidth is the actual bottleneck, not raw FLOPS. You can have all the tensor cores in the world, but if you can't feed them data fast enough, your Llama 3 inference is still going to crawl.
-
llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
-
-
Project mention: I Tested Flowise, Dify, and n8n Across 30+ Client Deployments. Here Is My Verdict. | dev.to | 2026-04-07
Citation Capsule: n8n's GitHub community reached 182,000+ stars across a 7-year development history, with 70+ AI-specific nodes added in 2024 to 2025. Source: n8n GitHub. Dify crossed 106,000 stars on GitHub with an Apache 2.0 license. Source: Dify GitHub. Flowise reached 51,000+ stars with MIT license. Source: Flowise GitHub. Dify's minimum recommended RAM is 4 GB versus Flowise's 1 GB and n8n's 300 MB. Source: Dify Docs.
-
Open-source alternative: The DIY pattern is a LlamaIndex RAG pipeline over your CRM + filings + news. Real, but takes a quarter to ship and 18 months to make trust-grade.
-
JeecgBoot
AI 低代码平台「低代码 + 零代码」双驱动!低代码可一键生成前后端代码;零代码可 5 分钟搭建系统;AI Skills 一句话画流程、设计表单、生成整套系统。内置 AI聊天、知识库、流程编排、MCP插件等,兼容主流大模型。引领「AI 生成 → 在线配置 → 代码生成 → 手工合并->AI修改」开发模式,消除 Java 项目 80% 的重复工作,提效而不失灵活。
-
Milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Milvus: Optimized for large-scale, distributed memory operations
-
MindsDB
Platform dedicated to building an open foundation for applied Artificial Intelligence, designed for people seeking production-ready AI systems they can truly control, extend and deploy anywhere.
Project mention: MindsDB Supercharges Google's MCP Toolbox with Unstructured Data Support | dev.to | 2025-12-29We’re happy to announce that we’ve integrated MindsDB with Google's open-source project, MCP (Model Context Protocol) Toolbox. This will make your AI applications very, very smart. This enhancement expands the Toolbox's reach, especially for organizations grappling with lots of siloed data.
-
quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
-
Project mention: Show HN: Query years of Ask HN and Show HN discussions as local knowledge graph | news.ycombinator.com | 2026-05-10
I built lightrag-snkv, Basically it uses lightRAG https://github.com/HKUDS/LightRAG ,this requires various storage databases like key value store, graph database, vector database, I built single embedded file based database which covers all these requirements: https://github.com/hash-anu/snkv.
-
Perplexica - Self-hosted AI search
-
khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
-
LangGraph AGENTS.md
-
Project mention: Graph RAG Isn't a One-Shot Anymore — The Case for Agentic Graph RAG MCPs | dev.to | 2026-05-07
The most well-known implementation right now is Microsoft's GraphRAG, released in 2024. The papers are well-written and I have a lot of respect for it. But the design philosophy is squarely from the one-shot retrieval era.
-
Birdclaw is interesting because it gives agents access to a Twitter archive. [17] GBrain points at a personal recall layer around OpenClaw / Hermes-style workflows. [18] PageIndex is a useful reminder that simple retrieval, even BM25-only retrieval, still has a place. [19] The “RAG comeback in about 8 months” take lands because the archive problem is still unsolved in practice. [20]
-
Onyx (formerly Danswer) is open source, self-hosted, and has pre-built connectors for Confluence, Jira, GitHub, Slack, and Google Drive. You deploy it with Docker, point it at your Atlassian and GitHub credentials, and it handles crawling, chunking, embedding, and incremental sync.
rag discussion
rag related posts
-
mem0 alternatives - MemClaw and Statewave
3 projects | 3 Jun 2026 -
NVIDIA RTX Spark: What the Backlash Gets Wrong About AI on Your Desktop [2026]
-
Open-source AI toolkit for e-commerce
-
OpenAI models on Bedrock make AI deployment less messy
-
What I Learned Building a Local RAG Agent
-
I Tested 33 AI Memory Engines — Here's What Actually Works
-
heym alternatives - n8n and sim
3 projects | 15 May 2026 -
A note from our sponsor - SaaSHub
www.saashub.com | 7 Jun 2026
Index
What are some of the best open-source rag projects? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | dify | 143,689 |
| 2 | open-webui | 139,852 |
| 3 | awesome-llm-apps | 113,059 |
| 4 | ragflow | 81,919 |
| 5 | PaddleOCR | 79,706 |
| 6 | LobeHub | 78,186 |
| 7 | Prompt-Engineering-Guide | 75,276 |
| 8 | anything-llm | 61,083 |
| 9 | llm-app | 59,431 |
| 10 | mem0 | 57,631 |
| 11 | Flowise | 53,317 |
| 12 | llama_index | 49,924 |
| 13 | JeecgBoot | 46,605 |
| 14 | Milvus | 44,649 |
| 15 | MindsDB | 39,243 |
| 16 | quivr | 39,171 |
| 17 | LightRAG | 36,193 |
| 18 | Vane | 35,178 |
| 19 | khoj | 34,892 |
| 20 | langgraph | 33,889 |
| 21 | graphrag | 33,458 |
| 22 | PageIndex | 32,583 |
| 23 | onyx | 30,038 |