A Comprehensive Guide for Building Rag-Based LLM Applications

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

llm-applications

3 1,486 8.4 Jupyter Notebook

A comprehensive guide to building RAG-based LLM applications for production.
vectara-answer

13 216 8.9 TypeScript

LLM-powered Conversational AI experience using Vectara

RAG is a very useful flow but I agree the complexity is often overwhelming, esp as you move from a toy example to a real production deployment. It's not just choosing a vector DB (last time I checked there were about 50), managing it, deciding on how to chunk data, etc. You also need to ensure your retrieval pipeline is accurate and fast, ensuring data is secure and private, and manage the whole thing as it scales. That's one of the main benefits of using Vectara (https://vectara.com; FD: I work there) - it's a GenAI platform that abstracts all this complexity away, and you can focus on building your application.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
model.nvim

3 262 9.6 Lua

Neovim plugin for interacting with LLM's and building editor integrated prompts.

For local stuff with a handful of documents, you can even just throw it into a json and call it a day. The similarity search is as simple as an np.dot: https://github.com/gsuuon/llm.nvim/blob/main/python3/store.p...

LLMStack

20 1,089 9.9 Python

No-code platform to build LLM Agents, workflows and applications with your data

Kudos to the team for a very detailed notebook going into things like pipeline evaluation wrt performance and costs etc. Even if we ignore the framework specific bits, it is a great guide to follow when building RAG systems in production.
We have been building RAG systems in production for a few months and have been tinkering with different strategies to get the most performance out of these pipelines. As others have pointed out, vector database may not be the right strategy for every problem. Similarly there are things like lost in the middle problems (https://arxiv.org/abs/2307.03172) that one may have to deal with. We put together our learnings building and optimizing these pipelines in a post at https://llmstack.ai/blog/retrieval-augmented-generation.
https://github.com/trypromptly/LLMStack is a low-code platform we open-sourced recently that ships these RAG pipelines out of the box with some app templates if anyone wants to try them out.

llama-hub

5 3,359 9.6 Jupyter Notebook

Discontinued A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain

My favorite example is the asana loader[0] for llama-index. It's literally just the most basic wrapper around the Asana SDK to concatenate some strings.
[0] - https://github.com/emptycrown/llama-hub/blob/main/llama_hub/...

pyod

7 7,941 7.7 Python

A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

This is a feature in many commercial products already, as well as open source libraries like PyOD. https://github.com/yzhao062/pyod

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Using Bitcoin and Blockchain ideas to Secure our AI Chatbot
1 project | dev.to | 19 Apr 2024
Rolling your own CAPTCHA solution
1 project | dev.to | 18 Apr 2024
Succeeding where NASDAQ fails
1 project | dev.to | 17 Apr 2024
Silicon Valley is a Pump and Dump Scheme
1 project | dev.to | 5 Mar 2024
Celebrating 10 Million Downloads
2 projects | dev.to | 8 Feb 2024

A Comprehensive Guide for Building Rag-Based LLM Applications

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
AI llms Machine Learning openai outlier-detection
Post date: 13 Sep 2023

llm-applications

vectara-answer

WorkOS

model.nvim

LLMStack

llama-hub

pyod

Related posts

A Comprehensive Guide for Building Rag-Based LLM Applications

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com AI llms Machine Learning openai outlier-detection Post date: 13 Sep 2023

llm-applications

vectara-answer

WorkOS

model.nvim

LLMStack

llama-hub

pyod

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
AI llms Machine Learning openai outlier-detection
Post date: 13 Sep 2023