Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Why do you think that https://github.com/f/awesome-chatgpt-prompts is a good alternative to TextRL