Code for the paper Fine-Tuning Language Models from Human Preferences
Why do you think that https://github.com/huggingface/trl is a good alternative to lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Why do you think that https://github.com/huggingface/trl is a good alternative to lm-human-preferences