A framework for few-shot evaluation of language models.
Why do you think that https://github.com/deepset-ai/haystack is a good alternative to lm-evaluation-harness
A framework for few-shot evaluation of language models.
Why do you think that https://github.com/deepset-ai/haystack is a good alternative to lm-evaluation-harness