OpenDILab Awesome Paper Collection: RL with Human Feedback （3）

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

awesome-RLHF

6 2,775 7.0

A curated list of reinforcement learning with human feedback resources (continually updated)

Recently, OpenDILab made a paper collection about Reinforcement Learning with Human Feedback (RLHF) and it has been open-sourced on GitHub. This repository is dedicated to helping researchers to collect the latest papers on RLHF, so that they can get to know this area better and more easily.

hh-rlhf

6 1,447 3.6

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

Title: Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Awesome RLHF (RL with Human Feedback) This is a collection of research papers for Reinforcement Learning with Human Feedback (RLHF). And the repository will be continuously updated to track the frontier of RLHF. Welcome to follow and star! https://github.com/opendilab/awesome-RLHF

1 project | /r/chatgpt_newtech | 21 Apr 2023
OpenDILab Awesome Paper Collection: RL with Human Feedback （1）

2 projects | /r/reinforcementlearning | 19 Apr 2023
A collection of research papers for Reinforcement Learning with Human Feedback (RLHF)

1 project | /r/GPT3 | 27 Mar 2023
Deep Reinforcement Learning: Zero to Hero

3 projects | news.ycombinator.com | 5 May 2024
How do I change the maximum number of steps for training

1 project | /r/MLAgents | 7 Dec 2023

OpenDILab Awesome Paper Collection: RL with Human Feedback （3）

This page summarizes the projects mentioned and recommended in the original post on /r/u_OpenDILab
Deep Learning deep-reinforcement-learning human-feedback reinforcement-learning rlhf
Post date: 14 May 2023

awesome-RLHF

hh-rlhf

InfluxDB

Related posts

Awesome RLHF (RL with Human Feedback) This is a collection of research papers for Reinforcement Learning with Human Feedback (RLHF). And the repository will be continuously updated to track the frontier of RLHF. Welcome to follow and star! https://github.com/opendilab/awesome-RLHF

OpenDILab Awesome Paper Collection: RL with Human Feedback （1）

A collection of research papers for Reinforcement Learning with Human Feedback (RLHF)

Deep Reinforcement Learning: Zero to Hero

How do I change the maximum number of steps for training

OpenDILab Awesome Paper Collection: RL with Human Feedback （3）

This page summarizes the projects mentioned and recommended in the original post on /r/u_OpenDILab Deep Learning deep-reinforcement-learning human-feedback reinforcement-learning rlhf Post date: 14 May 2023

awesome-RLHF

hh-rlhf

InfluxDB

Related posts

Awesome RLHF (RL with Human Feedback) This is a collection of research papers for Reinforcement Learning with Human Feedback (RLHF). And the repository will be continuously updated to track the frontier of RLHF. Welcome to follow and star! https://github.com/opendilab/awesome-RLHF

OpenDILab Awesome Paper Collection: RL with Human Feedback （1）

A collection of research papers for Reinforcement Learning with Human Feedback (RLHF)

Deep Reinforcement Learning: Zero to Hero

How do I change the maximum number of steps for training

This page summarizes the projects mentioned and recommended in the original post on /r/u_OpenDILab
Deep Learning deep-reinforcement-learning human-feedback reinforcement-learning rlhf
Post date: 14 May 2023