Python rag

Open-source Python projects categorized as rag

Top 23 Python rag Projects

  • llama_index

    LlamaIndex is a data framework for your LLM applications

    Project mention: Show HN: Route your prompts to the best LLM | news.ycombinator.com | 2024-05-22
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • quivr

    Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

    Project mention: Ask HN: Local RAG with private knowledge base | news.ycombinator.com | 2024-10-29
  • chatgpt-on-wechat

    基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

  • ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

    Project mention: 7 AI Open Source Libraries To Build RAG, Agents & AI Search | dev.to | 2024-11-14

    ⭐️ RAG Flow on GitHub

  • mem0

    The Memory layer for your AI apps

    Project mention: Show HN: Claude Memory – Long-term memory for Claude | news.ycombinator.com | 2024-09-05
  • kotaemon

    An open-source RAG-based tool for chatting with your documents.

    Project mention: Show HN: Quilt (Chat with your documents using cutting-edge RAG) | news.ycombinator.com | 2024-10-07

    E__m_c_2 4 days ago | next [–]

    this is fork of https://github.com/Cinnamon/kotaemon Please attribute accordingly

  • khoj

    Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, gemini, llama, qwen, mistral).

    Project mention: Top 13 Self-Hosted Projects with the Most GitHub Stars | dev.to | 2024-09-10

    GitHub https://github.com/khoj-ai/khoj GitHub Star 12.4k GitHub Fork 627 GitHub Issue 64 GitHub Pull Request 3 GitHub Contributor 35 Open Source License AGPL-3.0 Official Website https://khoj.dev/ Documentation https://docs.khoj.dev/

  • DB-GPT

    AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

  • vanna

    🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

    Project mention: Supercharging Obsidian Search with AI and Ollama | dev.to | 2024-11-26

    Essentially this solution is to let the AI *formulate the search* expression and not do the search itself (similar to the concept of generating a SQL statement instead of executing it https://github.com/vanna-ai/vanna).

  • danswer

    Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

    Project mention: Show HN: Danswer APIs – Open-source APIs for building RAG apps over company docs | news.ycombinator.com | 2024-11-12
  • LightRAG

    "LightRAG: Simple and Fast Retrieval-Augmented Generation"

    Project mention: LightRAG: Simple and Fast Retrieval-Augmented Generation | news.ycombinator.com | 2024-12-02
  • txtai

    💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

    Project mention: Postgres for Everything (E/Postgres) | news.ycombinator.com | 2024-12-06

    I fully agree. Postgres has solved many of the problems that many are re-solving with GenAI related databases.

    With txtai (https://github.com/neuml/txtai), I've went all in with Postgres + pgvector. Projects can start small with a SQLite backend then switch the persistence to Postgres. With this, you get all the years of battle-tested production experience from Postgres built-in for free.

  • rags

    Build ChatGPT over your data, all with natural language

    Project mention: SuperDuperDB - how to use it to talk to your documents locally using llama 7B or Mistral 7B? | /r/LocalLLaMA | 2023-12-09
  • TaskingAI

    The open source platform for AI-native application development.

    Project mention: TaskingAI: AI-native app development platform | news.ycombinator.com | 2024-01-30
  • superduper

    Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.

    Project mention: Build fully portable AI applications on top of Snowflake with SuperDuperDB | dev.to | 2024-06-26

    Customize how AI and databases work together. Scale your AI projects to handle more data and users. Move AI projects between different environments easily. Extend the system with new AI features and database functionality. Check it out: Blog: https://blog.superduperdb.com/version-02 Github: https://github.com/SuperDuperDB/superduperdb (leave us a star ⭐️🥳)

  • awesome-llm-apps

    Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

    Project mention: Ask HN: Recommendation for a SWE looking to get up to speed with latest on AI | news.ycombinator.com | 2024-11-27

    Unwind AI would be helpful. They publish daily newsletters on AI as well as tutorials on building apps with step-by-step walkthrough. Super focused on developers. https://www.theunwindai.com/

  • dataherald

    Interact with your SQL database, Natural Language to SQL using LLMs

    Project mention: AIM Weekly 27 May 2024 | dev.to | 2024-05-28
  • cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

    Project mention: 7 AI Open Source Libraries To Build RAG, Agents & AI Search | dev.to | 2024-11-14

    ⭐️ Cognita on GitHub

  • llama_parse

    Parse files for optimal RAG

    Project mention: Adding an Amazon Bedrock Knowledge Base to the Forex Rate Assistant | dev.to | 2024-05-16

    It's fair to think that undesirable artifacts and lack of structural context would impact search accuracy, performance, and ultimately cost. Consequently, it makes sense to perform some data pre-processing before passing the source documents to the RAG workflow. Third-party APIs and tools, such as LlamaParse and LayoutPDFReader, can help with pre-processing PDF data, however keep in mind that source documents may take any forms and there is no one-size-fits-all solution. You may have to resort to developing custom processes for pre-processing and search your unique data.

  • langroid

    Harness LLMs with Multi-Agent Programming

    Project mention: Understanding the BM25 full text search algorithm | news.ycombinator.com | 2024-11-19

    In the Langroid[1] LLM library we have a clean, extensible RAG implementation in the DocChatAgent[2] -- it uses several retrieval techniques, including lexical (bm25, fuzzy search) and semantic (embeddings), and re-ranking (using cross-encoder, reciprocal-rank-fusion) and also re-ranking for diversity and lost-in-the-middle mitigation:

    [1] Langroid - a multi-agent LLM framework from CMU/UW-Madison researchers https://github.com/langroid/langroid

    [2] DocChatAgent Implementation -

  • gptme

    Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.

    Project mention: Show HN: OpenSource]Personal AI assistant in your terminal | news.ycombinator.com | 2024-10-09
  • GenerativeAIExamples

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Project mention: RedLM: My submission for the NVIDIA and LlamaIndex Developer Contest | dev.to | 2024-11-14

    There are a LOT of options to consider when picking a vector database for a RAG application. Milvus has a highly decoupled architecture, it is fully open source and I had seen it in some examples in the NVIDIA/GenerativeAIExamples repo, so I decided to give it a try.

  • AdalFlow

    AdalFlow: The library to build & auto-optimize LLM applications.

    Project mention: Show HN: AdalFlow: The library to build and auto-optimize any LLM task pipeline | news.ycombinator.com | 2024-08-18
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python rag discussion

Log in or Post with

Python rag related posts

  • Top AI Search Engines for Business & Startups in 2025

    1 project | dev.to | 4 Dec 2024
  • Ask HN: Recommendation for a SWE looking to get up to speed with latest on AI

    5 projects | news.ycombinator.com | 27 Nov 2024
  • СontextCheck: LLM & RAG Evaluation Framework

    1 project | dev.to | 27 Nov 2024
  • 32k context length text embedding models

    4 projects | news.ycombinator.com | 23 Nov 2024
  • ContextCheck: An open-source framework for testing and evaluating LLMs, RAGs, Chatbots

    1 project | dev.to | 21 Nov 2024
  • ContextCheck: An open-source framework for testing and evaluating LLMs, RAGs, Chatbots

    1 project | dev.to | 21 Nov 2024
  • Show HN: We built open-source framework for testing LLMs, RAGs, Chatbots

    1 project | news.ycombinator.com | 21 Nov 2024
  • A note from our sponsor - SaaSHub
    www.saashub.com | 8 Dec 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source rag projects in Python? This list will help you:

Project Stars
1 llama_index 37,107
2 quivr 36,806
3 chatgpt-on-wechat 31,628
4 ragflow 24,382
5 mem0 23,196
6 kotaemon 17,722
7 khoj 16,776
8 DB-GPT 13,868
9 vanna 12,164
10 danswer 10,840
11 LightRAG 10,492
12 txtai 9,605
13 rags 6,334
14 TaskingAI 6,233
15 superduper 4,848
16 awesome-llm-apps 4,277
17 dataherald 3,366
18 cognita 3,355
19 llama_parse 3,290
20 langroid 2,732
21 gptme 2,679
22 GenerativeAIExamples 2,498
23 AdalFlow 2,280

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you konow that Python is
the 2nd most popular programming language
based on number of metions?