TruthfulQA: Measuring How Models Imitate Human Falsehoods
Why do you think that https://github.com/langchain-ai/auto-evaluator is a good alternative to TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Why do you think that https://github.com/langchain-ai/auto-evaluator is a good alternative to TruthfulQA