Suggest an alternative to

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Why do you think that https://github.com/nlpxucan/WizardLM is a good alternative to human-eval