safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback (by PKU-Alignment)

Safe-rlhf Alternatives

Similar projects and alternatives to safe-rlhf

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better safe-rlhf alternative or higher similarity.

safe-rlhf reviews and mentions

Posts with mentions or reviews of safe-rlhf. We have used some of these posts to build our list of alternatives and similar projects.

Stats

Basic safe-rlhf repo stats
1
1,149
8.3
7 days ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com