hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback" (by anthropics)

Hh-rlhf Alternatives

Similar projects and alternatives to hh-rlhf

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better hh-rlhf alternative or higher similarity.

hh-rlhf reviews and mentions

Posts with mentions or reviews of hh-rlhf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-14.

Stats

Basic hh-rlhf repo stats
6
1,441
3.6
8 months ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com