Python retrieval-augmented-generation

Open-source Python projects categorized as retrieval-augmented-generation

Top 23 Python retrieval-augmented-generation Projects

retrieval-augmented-generation
  1. storm

    An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

    Project mention: Code Explanation: "STORM: Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking" | dev.to | 2025-03-08

    Note: this explanation only covers the knowledge_storm in the storm repo because it aligns with my interests.

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. haystack

    AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

    Project mention: Show HN: A Medical Research Agent Built with BioMCP and Haystack | news.ycombinator.com | 2025-04-21

    I created a simple app to explore how Agents & MCP can help with medical research. It connects to ClinicalTrials.gov, PubMed/PubTator, and MyVariant.info using the BioMCP Server(https://github.com/genomoncology/biomcp), and uses Haystack(https://github.com/deepset-ai/haystack) as the MCP Client.

    The idea is to let users ask natural-language questions like:

  4. LightRAG

    "LightRAG: Simple and Fast Retrieval-Augmented Generation"

    Project mention: Making Sense of Congressional Data with LightRAG, Amazon Bedrock, and Ollama | dev.to | 2025-03-22

    LightRAG enhances RAG systems by integrating graph structures into text indexing and retrieval processes. In simple terms, it better connects related pieces of information, giving more accurate and quick answers. By combining graph relationships with vector-based retrieval, LightRAG pulls in context from both low-level details and high-level insights. An incremental update algorithm ensures your data stays fresh, making it a great choice when data is continuously evolving.

  5. llmware

    Unified framework for building enterprise RAG pipelines with small, specialized models

    Project mention: Llama.cpp guide – Running LLMs locally on any hardware, from scratch | news.ycombinator.com | 2024-11-29
  6. txtai

    πŸ’‘ All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

    Project mention: Chunking your data for RAG | dev.to | 2025-02-11
  7. FlagEmbedding

    Retrieval and Retrieval-augmented LLMs

    Project mention: Understanding RAG (Part 5): Recommendations and wrap-up | dev.to | 2024-09-09

    Choosing the right embedding model is equally important for effective semantic matching of queries and chunk blocks. To select the appropriate open-source embedding model, the authors conducted another experiment using the evaluation module of FlagEmbedding, which uses the dataset namespace-Pt/msmarco7 for queries and the dataset namespace-Pt/msmarco-corpus8 for the corpus and metrics like RR and MRR were used for evaluation.

  8. R2R

    SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.

    Project mention: Lists of open-source frameworks for building RAG applications | dev.to | 2025-01-02

    Ideal For: Applications requiring dynamic data handling and complex relationships between entities. GitHub Repository

  9. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  10. TaskingAI

    The open source platform for AI-native application development.

  11. cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

    Project mention: Lists of open-source frameworks for building RAG applications | dev.to | 2025-01-02

    Ideal For: Enterprises seeking a robust framework for large-scale AI applications. GitHub Repository

  12. AutoRAG

    AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

    Project mention: AIM Weekly 28 Oct 2024 | dev.to | 2024-10-28

    πŸ“Ž AutoRAG with Milvus πŸ› οΈ ADO 🫢 Self Hosting LLM 🌐 Noema Declarative AI πŸ“ New NIM Blueprint for building AI Virtual Assistant πŸš™ Zilliz Integrations 🫢 Using Milvus for Semantic Search πŸ€– Contextual Retrieval πŸ“Ž Meta: Quantized Light Weight Models πŸš™ https://arxiv.org/pdf/2407.01219 βœ… Cool Icons πŸ™Œ IBM Watson AI Milvus Bot πŸ“Ž The Hacker's Browser πŸ› οΈ Small and Mighty H2O Model πŸ“ Zilliz Cloud vs Qdrant πŸ’« Gravatino and Agents πŸ› οΈ OSS Summit Europe 2024 Report ▢️ RAG Strategi πŸ€– MS AI Data Visualizations 🌐 Graph RAG πŸ‘½ South Bay Meetup 15 Oct 2024 🦾 Influx and Milvus πŸ‘½ Multimodal Pipelines ✨ Constrained Sampling from LLM πŸš• BAML: Cheaper, Fast and More Accurate Function Calling πŸ“Š Infinite World Generation with outlines txt πŸ’» Ollama Client Swift πŸ” Atomic Agents πŸ•ΆοΈ PYMUPDF4LLM πŸš• Milvus for AI Agents πŸ“Š Fine Tuning LLAMA 3 with ORPO 🦾 Run NVIDIA Models πŸ’» LLM Training Meta Lingua ✨ 1 Bit LLM - MS BitNet πŸ’» Intro πŸ•ΆοΈ Mastering Chunk πŸ“Š Storm Stanford Tool 🐍 DAMO NLP SG CaRing πŸ” LLM Reasoners

  13. langroid

    Harness LLMs with Multi-Agent Programming

    Project mention: Understanding the BM25 full text search algorithm | news.ycombinator.com | 2024-11-19

    In the Langroid[1] LLM library we have a clean, extensible RAG implementation in the DocChatAgent[2] -- it uses several retrieval techniques, including lexical (bm25, fuzzy search) and semantic (embeddings), and re-ranking (using cross-encoder, reciprocal-rank-fusion) and also re-ranking for diversity and lost-in-the-middle mitigation:

    [1] Langroid - a multi-agent LLM framework from CMU/UW-Madison researchers https://github.com/langroid/langroid

    [2] DocChatAgent Implementation -

  14. GenerativeAIExamples

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Project mention: RedLM: My submission for the NVIDIA and LlamaIndex Developer Contest | dev.to | 2024-11-14

    There are a LOT of options to consider when picking a vector database for a RAG application. Milvus has a highly decoupled architecture, it is fully open source and I had seen it in some examples in the NVIDIA/GenerativeAIExamples repo, so I decided to give it a try.

  15. Agent-S

    Agent S: an open agentic framework that uses computers like a human

    Project mention: AIM Weekly for 04Nov2024 | dev.to | 2024-11-04

    🌐 Composed Image Retrieval πŸ“Ž Intro to Multimodal LLama 3.2 πŸ› οΈ Multi Agent Concierge πŸ’» RAG with Langchain Granite, Milvus 🫢 Download content βœ… Transformer Replacement? πŸ€– vLLM for runing models 🌐 Amphion πŸ“ Autogluon πŸš™ Notebook LLama like Google's Notebook LLM 🫢 Monocle2ai for tracing GenAI app code LFA&D Project πŸ€– Bee Agent Framework βœ… LLama RFP Response ▢️ GenAI Script πŸ‘½ Simular AI Agent S 🦾 DrawDB with AI ✨ Ollama with LLama 3.2 Vision!!!! Preview πŸš• Powerful RAG Checker πŸ“Š SQL Generator πŸ’» Role of LLMs 🐍 Document Extraction πŸ•ΆοΈ Open Source Vector DB Reddit πŸ” The Practical Guide to Self Hosting LLM 🦾 Stagehand Controller πŸ•ΆοΈ Understanding HNSWLIB 🐍 Best practices in RAG πŸ’» Enigma Agent πŸ“ Langchain, Ollama, Phi3 for Function Calling πŸ”‹ Compass Judger πŸ“ Princeton NLP SimPO πŸ” Princeton NLP ProLong πŸ”‹ Princeton NLP HELMET 🧐 Ollama Cheatsheet πŸš• Princeton NLP CopyCat πŸ“Š Princeton NLP Shp πŸ•ΆοΈ Can LLM Solve Hard Github Issues πŸ“ Enabling Large Language Models to Generate Text with Citations πŸ”‹ Princeton NLP CharXiv πŸ“Š Awesome AI Agents List 🦾 Nomic’s Matryoshka text embedding model

  16. fastembed

    Fast, Accurate, Lightweight Python library to make State of the Art Embedding

  17. colpali

    The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

    Project mention: Integrating Vision-Language Models into Agentic RAG Systems with ColPali | dev.to | 2025-03-31

    If you want to learn more about ColPali, you can refer to the official documentation and also I would recommend you to read the 9 part blog series on RAG on DailyDoseofDS by Avi Chawla and Akshay Pachaar.

  18. raptor

    The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

    Project mention: Graph RAG의 λͺ¨λ“  것 | dev.to | 2025-04-20

    3.2. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval (Stanford Univ, 2024)

  19. raglite

    πŸ₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite

    Project mention: Show HN: RAGLite – A Python package for the unhobbling of RAG | news.ycombinator.com | 2024-12-19
  20. rag-demystified

    An LLM-powered advanced RAG pipeline built from scratch

  21. AnglE

    Train and Infer Powerful Sentence Embeddings with AnglE | πŸ”₯ SOTA on STS and MTEB Leaderboard (by SeanLee97)

  22. obsidian-copilot

    πŸ€– A prototype assistant for writing and thinking (by eugeneyan)

  23. continuous-eval

    Data-Driven Evaluation for LLM-Powered Applications

    Project mention: Show HN: Ellipsis – Automated PR reviews and bug fixes | news.ycombinator.com | 2024-05-09
  24. xmc.dspy

    In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

    Project mention: Betting on DSPy for Systems of LLMs | news.ycombinator.com | 2024-08-10

    The abstractions could be cleaner. I think some of the convolution is due to the evolution that it has undergone and core contributors have not come around to being fully β€œout with the old”.

    I think there might be practical benefits to it. The XMC example illustrates it for me:

    https://github.com/KarelDO/xmc.dspy

  25. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python retrieval-augmented-generation discussion

Log in or Post with

Python retrieval-augmented-generation related posts

  • Graph RAG의 λͺ¨λ“  것

    4 projects | dev.to | 20 Apr 2025
  • Integrating Vision-Language Models into Agentic RAG Systems with ColPali

    2 projects | dev.to | 31 Mar 2025
  • Getting started with LLM APIs

    3 projects | dev.to | 2 Jan 2025
  • Lists of open-source frameworks for building RAG applications

    7 projects | dev.to | 2 Jan 2025
  • RAPTOR: A Novel Tree-Based Retrieval System for Enhancing Language Models – Research Summary

    1 project | dev.to | 13 Dec 2024
  • LightRAG: Simple and Fast Retrieval-Augmented Generation

    1 project | news.ycombinator.com | 2 Dec 2024
  • Understanding the BM25 full text search algorithm

    5 projects | news.ycombinator.com | 19 Nov 2024
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 27 Apr 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more β†’

Index

What are some of the best open-source retrieval-augmented-generation projects in Python? This list will help you:

# Project Stars
1 storm 23,997
2 haystack 20,427
3 LightRAG 15,523
4 llmware 13,091
5 txtai 10,798
6 FlagEmbedding 9,440
7 R2R 6,501
8 TaskingAI 5,103
9 cognita 4,011
10 AutoRAG 3,856
11 langroid 3,234
12 GenerativeAIExamples 3,006
13 swirl-search 2,745
14 Agent-S 2,403
15 fastembed 1,995
16 colpali 1,769
17 raptor 1,191
18 raglite 924
19 rag-demystified 832
20 AnglE 534
21 obsidian-copilot 532
22 continuous-eval 489
23 xmc.dspy 403

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?