Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Why do you think that https://github.com/my-other-github-account/llm-humaneval-ben is a good alternative to BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Why do you think that https://github.com/my-other-github-account/llm-humaneval-ben is a good alternative to BIG-Bench-Hard