A modular RL library to fine-tune language models to human preferences
Why do you think that https://github.com/hpcaitech/ColossalAI is a good alternative to RL4LMs
A modular RL library to fine-tune language models to human preferences
Why do you think that https://github.com/hpcaitech/ColossalAI is a good alternative to RL4LMs