Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
Why do you think that https://github.com/teknium1/GPTeacher is a good alternative to llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
Why do you think that https://github.com/teknium1/GPTeacher is a good alternative to llm-jeopardy