A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Here you can share your experience with the project you are suggesting or its comparison with trlx. Optional.
A valid email to send you a verification link when necessary or log in.