Show HN: Ragas – the de facto open-source standard for evaluating RAG pipelines

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • ragas

    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

  • Ragas is an open-source library for evaluating and testing RAG and other LLM applications. Ragas provides you with different sets of metrics and methods like synthetic test data generation to help you evaluate your RAG applications. Ragas started off by scratching our own itch for evaluating our RAG chatbots last year.

    ### Problems Ragas Can Solve

    - How do you choose the best components for your RAG, such as the retriever, reranker, and LLM?

    - How do you formulate a test dataset without spending tons of money and time?

    We believe there needs to be an open-source standard for evaluating and testing LLM applications, and our vision is to build it for the community. We are tackling this challenge by evolving the ideas from the traditional ML lifecycle for LLM applications.

    #### ML Testing Evolved for LLM Applications

    We built Ragas on the principles of metrics-driven development and aim to develop and innovate techniques inspired by state-of-the-art research to solve the problems in evaluating and testing LLM applications.

    We don't believe that the problem of evaluating and testing applications can be solved by building a fancy tracing tool; rather, we want to solve the problem from a layer under the stack. For this, we are introducing methods like automated synthetic test data curation, metrics, and feedback utilisation, which are inspired by lessons learned from deploying stochastic models in our careers as ML engineers.

    While currently focused on RAG pipelines, our goal is to extend Ragas for testing a wide array of compound systems, including those based on RAGs, agentic workflows, and various transformations.

    #### Try ragas

    Try out ragas here https://colab.research.google.com/github/shahules786/openai-... in Google Colab. Read our docs - https://docs.ragas.io/ to know more

    We would love to hear feedback from the HN community :)

  • deepeval

    The LLM Evaluation Framework

  • This is nice, we've got more Open Source LLM Evaluation Libraries coming in more often.

    We're using DeepEval (https://github.com/confident-ai/deepeval) currently. How is this different from that?

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Should I add CLA to my Open-source project?

    2 projects | news.ycombinator.com | 4 May 2024
  • Pydantic Logfire

    7 projects | news.ycombinator.com | 30 Apr 2024
  • AI leaderboards are no longer useful. It's time to switch to Pareto curves

    1 project | news.ycombinator.com | 30 Apr 2024
  • Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry

    1 project | news.ycombinator.com | 27 Apr 2024
  • Why Vector Compression Matters

    3 projects | dev.to | 24 Apr 2024