SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python rag Projects
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
-
chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
-
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
⭐️ RAG Flow on GitHub
-
Project mention: Show HN: Claude Memory – Long-term memory for Claude | news.ycombinator.com | 2024-09-05
-
Project mention: Show HN: Quilt (Chat with your documents using cutting-edge RAG) | news.ycombinator.com | 2024-10-07
E__m_c_2 4 days ago | next [–]
this is fork of https://github.com/Cinnamon/kotaemon Please attribute accordingly
-
khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, gemini, llama, qwen, mistral).
GitHub https://github.com/khoj-ai/khoj GitHub Star 12.4k GitHub Fork 627 GitHub Issue 64 GitHub Pull Request 3 GitHub Contributor 35 Open Source License AGPL-3.0 Official Website https://khoj.dev/ Documentation https://docs.khoj.dev/
-
DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
-
Essentially this solution is to let the AI *formulate the search* expression and not do the search itself (similar to the concept of generating a SQL statement instead of executing it https://github.com/vanna-ai/vanna).
-
Project mention: Show HN: Danswer APIs – Open-source APIs for building RAG apps over company docs | news.ycombinator.com | 2024-11-12
-
Project mention: LightRAG: Simple and Fast Retrieval-Augmented Generation | news.ycombinator.com | 2024-12-02
-
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
I fully agree. Postgres has solved many of the problems that many are re-solving with GenAI related databases.
With txtai (https://github.com/neuml/txtai), I've went all in with Postgres + pgvector. Projects can start small with a SQLite backend then switch the persistence to Postgres. With this, you get all the years of battle-tested production experience from Postgres built-in for free.
-
Project mention: SuperDuperDB - how to use it to talk to your documents locally using llama 7B or Mistral 7B? | /r/LocalLLaMA | 2023-12-09
-
-
superduper
Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.
Project mention: Build fully portable AI applications on top of Snowflake with SuperDuperDB | dev.to | 2024-06-26Customize how AI and databases work together. Scale your AI projects to handle more data and users. Move AI projects between different environments easily. Extend the system with new AI features and database functionality. Check it out: Blog: https://blog.superduperdb.com/version-02 Github: https://github.com/SuperDuperDB/superduperdb (leave us a star ⭐️🥳)
-
awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
Project mention: Ask HN: Recommendation for a SWE looking to get up to speed with latest on AI | news.ycombinator.com | 2024-11-27Unwind AI would be helpful. They publish daily newsletters on AI as well as tutorials on building apps with step-by-step walkthrough. Super focused on developers. https://www.theunwindai.com/
-
-
cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
⭐️ Cognita on GitHub
-
Project mention: Adding an Amazon Bedrock Knowledge Base to the Forex Rate Assistant | dev.to | 2024-05-16
It's fair to think that undesirable artifacts and lack of structural context would impact search accuracy, performance, and ultimately cost. Consequently, it makes sense to perform some data pre-processing before passing the source documents to the RAG workflow. Third-party APIs and tools, such as LlamaParse and LayoutPDFReader, can help with pre-processing PDF data, however keep in mind that source documents may take any forms and there is no one-size-fits-all solution. You may have to resort to developing custom processes for pre-processing and search your unique data.
-
Project mention: Understanding the BM25 full text search algorithm | news.ycombinator.com | 2024-11-19
In the Langroid[1] LLM library we have a clean, extensible RAG implementation in the DocChatAgent[2] -- it uses several retrieval techniques, including lexical (bm25, fuzzy search) and semantic (embeddings), and re-ranking (using cross-encoder, reciprocal-rank-fusion) and also re-ranking for diversity and lost-in-the-middle mitigation:
[1] Langroid - a multi-agent LLM framework from CMU/UW-Madison researchers https://github.com/langroid/langroid
[2] DocChatAgent Implementation -
-
gptme
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
Project mention: Show HN: OpenSource]Personal AI assistant in your terminal | news.ycombinator.com | 2024-10-09 -
GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Project mention: RedLM: My submission for the NVIDIA and LlamaIndex Developer Contest | dev.to | 2024-11-14There are a LOT of options to consider when picking a vector database for a RAG application. Milvus has a highly decoupled architecture, it is fully open source and I had seen it in some examples in the NVIDIA/GenerativeAIExamples repo, so I decided to give it a try.
-
Project mention: Show HN: AdalFlow: The library to build and auto-optimize any LLM task pipeline | news.ycombinator.com | 2024-08-18
Python rag discussion
Python rag related posts
-
Top AI Search Engines for Business & Startups in 2025
-
Ask HN: Recommendation for a SWE looking to get up to speed with latest on AI
-
СontextCheck: LLM & RAG Evaluation Framework
-
32k context length text embedding models
-
ContextCheck: An open-source framework for testing and evaluating LLMs, RAGs, Chatbots
-
ContextCheck: An open-source framework for testing and evaluating LLMs, RAGs, Chatbots
-
Show HN: We built open-source framework for testing LLMs, RAGs, Chatbots
-
A note from our sponsor - SaaSHub
www.saashub.com | 8 Dec 2024
Index
What are some of the best open-source rag projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | llama_index | 37,107 |
2 | quivr | 36,806 |
3 | chatgpt-on-wechat | 31,628 |
4 | ragflow | 24,382 |
5 | mem0 | 23,196 |
6 | kotaemon | 17,722 |
7 | khoj | 16,776 |
8 | DB-GPT | 13,868 |
9 | vanna | 12,164 |
10 | danswer | 10,840 |
11 | LightRAG | 10,492 |
12 | txtai | 9,605 |
13 | rags | 6,334 |
14 | TaskingAI | 6,233 |
15 | superduper | 4,848 |
16 | awesome-llm-apps | 4,277 |
17 | dataherald | 3,366 |
18 | cognita | 3,355 |
19 | llama_parse | 3,290 |
20 | langroid | 2,732 |
21 | gptme | 2,679 |
22 | GenerativeAIExamples | 2,498 |
23 | AdalFlow | 2,280 |