Code for the paper "Evaluating Large Language Models Trained on Code"
Why do you think that https://github.com/nlpxucan/WizardLM is a good alternative to human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
Why do you think that https://github.com/nlpxucan/WizardLM is a good alternative to human-eval