Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more β
Top 23 Python retrieval-augmented-generation Projects
-
storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Project mention: Code Explanation: "STORM: Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking" | dev.to | 2025-03-08Note: this explanation only covers the knowledge_storm in the storm repo because it aligns with my interests.
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Project mention: Show HN: A Medical Research Agent Built with BioMCP and Haystack | news.ycombinator.com | 2025-04-21I created a simple app to explore how Agents & MCP can help with medical research. It connects to ClinicalTrials.gov, PubMed/PubTator, and MyVariant.info using the BioMCP Server(https://github.com/genomoncology/biomcp), and uses Haystack(https://github.com/deepset-ai/haystack) as the MCP Client.
The idea is to let users ask natural-language questions like:
-
Project mention: Making Sense of Congressional Data with LightRAG, Amazon Bedrock, and Ollama | dev.to | 2025-03-22
LightRAG enhances RAG systems by integrating graph structures into text indexing and retrieval processes. In simple terms, it better connects related pieces of information, giving more accurate and quick answers. By combining graph relationships with vector-based retrieval, LightRAG pulls in context from both low-level details and high-level insights. An incremental update algorithm ensures your data stays fresh, making it a great choice when data is continuously evolving.
-
Project mention: Llama.cpp guide β Running LLMs locally on any hardware, from scratch | news.ycombinator.com | 2024-11-29
-
txtai
π‘ All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
-
Choosing the right embedding model is equally important for effective semantic matching of queries and chunk blocks. To select the appropriate open-source embedding model, the authors conducted another experiment using the evaluation module of FlagEmbedding, which uses the dataset namespace-Pt/msmarco7 for queries and the dataset namespace-Pt/msmarco-corpus8 for the corpus and metrics like RR and MRR were used for evaluation.
-
R2R
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
Project mention: Lists of open-source frameworks for building RAG applications | dev.to | 2025-01-02Ideal For: Applications requiring dynamic data handling and complex relationships between entities. GitHub Repository
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Project mention: Lists of open-source frameworks for building RAG applications | dev.to | 2025-01-02Ideal For: Enterprises seeking a robust framework for large-scale AI applications. GitHub Repository
-
AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
π AutoRAG with Milvus π οΈ ADO π«Ά Self Hosting LLM π Noema Declarative AI π New NIM Blueprint for building AI Virtual Assistant π Zilliz Integrations π«Ά Using Milvus for Semantic Search π€ Contextual Retrieval π Meta: Quantized Light Weight Models π https://arxiv.org/pdf/2407.01219 β Cool Icons π IBM Watson AI Milvus Bot π The Hacker's Browser π οΈ Small and Mighty H2O Model π Zilliz Cloud vs Qdrant π« Gravatino and Agents π οΈ OSS Summit Europe 2024 Report βΆοΈ RAG Strategi π€ MS AI Data Visualizations π Graph RAG π½ South Bay Meetup 15 Oct 2024 π¦Ύ Influx and Milvus π½ Multimodal Pipelines β¨ Constrained Sampling from LLM π BAML: Cheaper, Fast and More Accurate Function Calling π Infinite World Generation with outlines txt π» Ollama Client Swift π Atomic Agents πΆοΈ PYMUPDF4LLM π Milvus for AI Agents π Fine Tuning LLAMA 3 with ORPO π¦Ύ Run NVIDIA Models π» LLM Training Meta Lingua β¨ 1 Bit LLM - MS BitNet π» Intro πΆοΈ Mastering Chunk π Storm Stanford Tool π DAMO NLP SG CaRing π LLM Reasoners
-
Project mention: Understanding the BM25 full text search algorithm | news.ycombinator.com | 2024-11-19
In the Langroid[1] LLM library we have a clean, extensible RAG implementation in the DocChatAgent[2] -- it uses several retrieval techniques, including lexical (bm25, fuzzy search) and semantic (embeddings), and re-ranking (using cross-encoder, reciprocal-rank-fusion) and also re-ranking for diversity and lost-in-the-middle mitigation:
[1] Langroid - a multi-agent LLM framework from CMU/UW-Madison researchers https://github.com/langroid/langroid
[2] DocChatAgent Implementation -
-
GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Project mention: RedLM: My submission for the NVIDIA and LlamaIndex Developer Contest | dev.to | 2024-11-14There are a LOT of options to consider when picking a vector database for a RAG application. Milvus has a highly decoupled architecture, it is fully open source and I had seen it in some examples in the NVIDIA/GenerativeAIExamples repo, so I decided to give it a try.
-
swirl-search
Swirl is an open-source search platform that uses AI to search multiple content and data sources simultaneously and return AI-ranked results. And provides summaries of your answers from searches using LLMs. It's a one-click, easy-to-use Retrieval Augmented Generation (RAG) Solution.
Project mention: How These Free Open Source Projects Can Jumpstart Your Career (No Experience? No Problem!) | dev.to | 2024-12-13Give SWIRL a try: https://github.com/swirlai/swirl-search
-
π Composed Image Retrieval π Intro to Multimodal LLama 3.2 π οΈ Multi Agent Concierge π» RAG with Langchain Granite, Milvus π«Ά Download content β Transformer Replacement? π€ vLLM for runing models π Amphion π Autogluon π Notebook LLama like Google's Notebook LLM π«Ά Monocle2ai for tracing GenAI app code LFA&D Project π€ Bee Agent Framework β LLama RFP Response βΆοΈ GenAI Script π½ Simular AI Agent S π¦Ύ DrawDB with AI β¨ Ollama with LLama 3.2 Vision!!!! Preview π Powerful RAG Checker π SQL Generator π» Role of LLMs π Document Extraction πΆοΈ Open Source Vector DB Reddit π The Practical Guide to Self Hosting LLM π¦Ύ Stagehand Controller πΆοΈ Understanding HNSWLIB π Best practices in RAG π» Enigma Agent π Langchain, Ollama, Phi3 for Function Calling π Compass Judger π Princeton NLP SimPO π Princeton NLP ProLong π Princeton NLP HELMET π§ Ollama Cheatsheet π Princeton NLP CopyCat π Princeton NLP Shp πΆοΈ Can LLM Solve Hard Github Issues π Enabling Large Language Models to Generate Text with Citations π Princeton NLP CharXiv π Awesome AI Agents List π¦Ύ Nomicβs Matryoshka text embedding model
-
-
colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Project mention: Integrating Vision-Language Models into Agentic RAG Systems with ColPali | dev.to | 2025-03-31If you want to learn more about ColPali, you can refer to the official documentation and also I would recommend you to read the 9 part blog series on RAG on DailyDoseofDS by Avi Chawla and Akshay Pachaar.
-
raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
3.2. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval (Stanford Univ, 2024)
-
raglite
π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite
Project mention: Show HN: RAGLite β A Python package for the unhobbling of RAG | news.ycombinator.com | 2024-12-19 -
-
AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | π₯ SOTA on STS and MTEB Leaderboard (by SeanLee97)
-
-
Project mention: Show HN: Ellipsis β Automated PR reviews and bug fixes | news.ycombinator.com | 2024-05-09
-
xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
The abstractions could be cleaner. I think some of the convolution is due to the evolution that it has undergone and core contributors have not come around to being fully βout with the oldβ.
I think there might be practical benefits to it. The XMC example illustrates it for me:
https://github.com/KarelDO/xmc.dspy
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
Python retrieval-augmented-generation discussion
Python retrieval-augmented-generation related posts
-
Graph RAGμ λͺ¨λ κ²
-
Integrating Vision-Language Models into Agentic RAG Systems with ColPali
-
Getting started with LLM APIs
-
Lists of open-source frameworks for building RAG applications
-
RAPTOR: A Novel Tree-Based Retrieval System for Enhancing Language Models β Research Summary
-
LightRAG: Simple and Fast Retrieval-Augmented Generation
-
Understanding the BM25 full text search algorithm
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 27 Apr 2025
Index
What are some of the best open-source retrieval-augmented-generation projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | storm | 23,997 |
2 | haystack | 20,427 |
3 | LightRAG | 15,523 |
4 | llmware | 13,091 |
5 | txtai | 10,798 |
6 | FlagEmbedding | 9,440 |
7 | R2R | 6,501 |
8 | TaskingAI | 5,103 |
9 | cognita | 4,011 |
10 | AutoRAG | 3,856 |
11 | langroid | 3,234 |
12 | GenerativeAIExamples | 3,006 |
13 | swirl-search | 2,745 |
14 | Agent-S | 2,403 |
15 | fastembed | 1,995 |
16 | colpali | 1,769 |
17 | raptor | 1,191 |
18 | raglite | 924 |
19 | rag-demystified | 832 |
20 | AnglE | 534 |
21 | obsidian-copilot | 532 |
22 | continuous-eval | 489 |
23 | xmc.dspy | 403 |