blog
ragas
blog | ragas | |
---|---|---|
3 | 10 | |
5 | 4,668 | |
- | 14.1% | |
9.5 | 9.6 | |
20 days ago | 5 days ago | |
HTML | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
blog
- Patterns for Building LLM-Based Systems and Products
-
Critical New 0-day Vulnerability in Popular Log4j Library - List of applications
Vespa ENGINE : https://github.com/vespa-engine/blog/blob/f281ce4399ed3e97b4fed32fcc36f9ba4b17b1e2/_posts/2021-12-10-log4j-vulnerability.md
ragas
-
Show HN: Ragas – the de facto open-source standard for evaluating RAG pipelines
congrats on launching! i think my continuing struggle with looking at Ragas as a company rather than an oss library is that the core of it is like 8 metrics (https://github.com/explodinggradients/ragas/tree/main/src/ra...) that are each 1-200 LOC. i can inline that easily in my app and retain full control, or model that in langchain or haystack or whatever.
why is Ragas a library and a company, rather than an overall "standard" or philosophy (eg like Heroku's 12 Factor Apps) that could maybe be more robust?
(just giving an opp to pitch some underappreciated benefits of using this library)
- FLaNK 04 March 2024
- FLaNK Stack 05 Feb 2024
-
SuperDuperDB - how to use it to talk to your documents locally using llama 7B or Mistral 7B?
Also, at some point you'll need to get serious about evaluation (trust me, you will). You may be interested in https://github.com/explodinggradients/ragas
- Ragas – Framework for RAG Evaluation
- Ragas: Open-source Evaluation framework for RAG pipelines
-
Building a customer support chatbot using GPT-3.5 and lLamaIndex🚀
The problem becomes worse if you want to inspect outputs from not just one, but several different queries. Luckily, there are several free open source packages such as ragas and DeepEval that can help evaluate your chatbot so you don't have to manually do it 😌
-
Patterns for Building LLM-Based Systems and Products
We have build RAGAS framework for this https://github.com/explodinggradients/ragas
-
[R] All about evaluating Large language models
Hi u/thecuteturtle, I am building open-source projects for evaluating LLM-based applications. Check it out https://github.com/explodinggradients/ragas and if you like to collaborate let me know :)
What are some alternatives?
signald
deepeval - The LLM Evaluation Framework
security-advisories - Security Advisories for the Jitsi projects
chameleon-llm - Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
agenta - The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
Local-LLM-Langchain - Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and KoboldAI versions of the langchain notebooks with examples.
OpenPipe - Turn expensive prompts into cheap fine-tuned models
FastLoRAChat - Instruct-tune LLaMA on consumer hardware with shareGPT data
llama_index - LlamaIndex is a data framework for your LLM applications
security-tools
text-generation-webui-colab - A colab gradio web UI for running Large Language Models