Show HN: R2R – Open-source framework for production-grade RAG

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • R2R

    The framework for fast development and deployment of RAG systems.

  • You mentioned "Generate example queries", there is already an example that shows how to generate and search over synthetic queries w/ minor tweaks to the basic pipeline [https://github.com/SciPhi-AI/R2R/blob/main/examples/academy/...].

    I think the other other approaches you outline are all worth investigating as well. There is definitely a tension we face between building and testing new experimental approaches vs. figuring out what features people need in production and implementing those.

    Just so you know where we are heading - we want to make sure all the features are there for easy experimentation, but we also want to provide value into production and beyond. As an example, we are currently working on robust task orchestration to accompany our pipeline abstractions to help with ingesting large quantities of data, as this has been a painpoint in our own experience and that of some of our early enterprise users.

  • hatchet

    A distributed, fault-tolerant task queue

  • This is a great question, thanks for asking.

    We are testing workflows internally that use orchestration software like Hatchet/Temporal to allow the framework to robustly handle 100s of GBs of upload data from parsing to chunking to embedding to storing [1][2]. The goal is to build durable execution at each step, because even steps like PDF extraction can be expensive / time consuming. We are targeting an prelim. release of these features in < 1 month.

    Logging is built natively into the framework with postgres or sqlite options. We ship a GUI that leverages these logs and the application flow to allow developers to see queries, search results, and RAG completions in realtime.

    We are planning on adding more features here to help with evaluation / insight as we get further feedback.

    On the A/B, slow rollout, and analytics side, we are still early but suspect there is a lot of value to be had here, particularly because human feedback is pretty crucial in optimizing any RAG system. Developer feedback will be particularly important here since there are a lot of paths to choose between.

    [1] https://hatchet.run/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • canopy

    Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Build a simple RAG chatbot with LangChain...

    2 projects | dev.to | 17 May 2024
  • ChatGPT provides false information about people, and OpenAI can't correct it

    1 project | news.ycombinator.com | 29 Apr 2024
  • Using Vector Embeddings to Overengineer 404 pages

    1 project | dev.to | 17 Jan 2024
  • Pinecone: Build Knowledgeable AI

    1 project | news.ycombinator.com | 16 Jan 2024
  • Canopy is an open-source Retrieval Augmented Generation (RAG) framework

    2 projects | news.ycombinator.com | 7 Dec 2023