Practical_RL vs awesome-RLHF

Practical_RL

A course in reinforcement learning in the wild (by yandexdataschool)

Source Code

Suggest alternative

Edit details

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated) (by opendilab)

Deep Learning deep-reinforcement-learning human-feedback reinforcement-learning rlhf large-language-models

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

Practical_RL		awesome-RLHF
	Project
2	Mentions	6
5,716	Stars	2,739
1.2%	Growth	8.3%
6.0	Activity	7.0
17 days ago	Latest Commit	11 days ago
Jupyter Notebook	Language
The Unlicense	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Practical_RL

Posts with mentions or reviews of Practical_RL. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-03-25.

[D] implementation of MCTS in Python
1 project | /r/MachineLearning | 4 Dec 2021
Alternatives to OpenAI’s spinning up?
2 projects | /r/reinforcementlearning | 25 Mar 2021

there is this great github repo where there are lectures and other resources, and have a week by week jupyter notebooks where they explain and code with homeworks at the very end of it. is basics and deepRL, but just dqn and DDPG/ppo but i think will give you good start in the topic for later star working on your own.

awesome-RLHF

Posts with mentions or reviews of awesome-RLHF. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-14.

OpenDILab Awesome Paper Collection: RL with Human Feedback （3）
2 projects | /r/u_OpenDILab | 14 May 2023

Recently, OpenDILab made a paper collection about Reinforcement Learning with Human Feedback (RLHF) and it has been open-sourced on GitHub. This repository is dedicated to helping researchers to collect the latest papers on RLHF, so that they can get to know this area better and more easily.
Awesome RLHF (RL with Human Feedback) This is a collection of research papers for Reinforcement Learning with Human Feedback (RLHF). And the repository will be continuously updated to track the frontier of RLHF. Welcome to follow and star! https://github.com/opendilab/awesome-RLHF
1 project | /r/chatgpt_newtech | 21 Apr 2023

Welcome to follow and star! https://github.com/opendilab/awesome-RLHF
OpenDILab Awesome Paper Collection: RL with Human Feedback （1）
2 projects | /r/reinforcementlearning | 19 Apr 2023

Here we’re gonna introduce a new repository open-sourced by OpenDILab. Recently, OpenDILab made a paper collection about Reinforcement Learning with Human Feedback (RLHF) and it has been open-sourced on GitHub. This repository is dedicated to helping researchers to collect the latest papers on RLHF, so that they can get to know this area better and more easily. About RLHF Reinforcement Learning with Human Feedback (RLHF) is an extended branch of Reinforcement Learning (RL) that allows the RLHF family of methods to incorporate human feedback into the training process by using this feedback to construct By using this feedback to build a reward model neural network that provides reward signals to help RL intelligences learn, human needs, preferences, and perceptions can be more naturally communicated to the intelligence in an interactive learning manner, aligning the optimization goals between humans and artificial intelligence to produce systems that behave in a manner consistent with human values. Reinforcement Learning with Human Feedback (RLHF) is an extended branch of Reinforcement Learning. When the optimization goal is abstract and it's very difficult to define the specific reward function, RLHF can help to put human feedback into the training process. This feedback can be constructed into a reward neural network model so that RL agents can learn from the given reward signal and naturally convey human needs, preference and attitude to agents through interactive learning.
A collection of research papers for Reinforcement Learning with Human Feedback (RLHF)
1 project | /r/GPT3 | 27 Mar 2023

1 project | /r/reinforcementlearning | 16 Mar 2023

1 project | /r/ChatGPT | 16 Mar 2023

What are some alternatives?

When comparing Practical_RL and awesome-RLHF you can also consider the following projects:

webdataset - A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

LaMDA-rlhf-pytorch - Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

FunMatch-Distillation - TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.

deep-learning-drizzle - Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

awesome-rl - Reinforcement learning resources curated

hh-rlhf - Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

labml - 🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱

visual-chatgpt - Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models [Moved to: https://github.com/microsoft/TaskMatrix]

alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

redisai-examples - RedisAI showcase

TensorFlow-Tutorials - TensorFlow Tutorials with YouTube Videos

YPDL-Build-a-movie-recommendation-engine-with-TensorFlow - In this tutorial, we are going to build a Restricted Boltzmann Machine using TensorFlow that will give us recommendations based on movies that have been watched already. The datasets we are going to use are acquired from GroupLens and contains movies, users, and movie ratings by these users.

Practical_RL vs webdataset awesome-RLHF vs LaMDA-rlhf-pytorch Practical_RL vs FunMatch-Distillation awesome-RLHF vs deep-learning-drizzle Practical_RL vs awesome-rl awesome-RLHF vs hh-rlhf Practical_RL vs labml awesome-RLHF vs visual-chatgpt Practical_RL vs alpha-zero-general Practical_RL vs redisai-examples Practical_RL vs TensorFlow-Tutorials Practical_RL vs YPDL-Build-a-movie-recommendation-engine-with-TensorFlow

Compare Practical_RL vs awesome-RLHF and see what are their differences.

Practical_RL

awesome-RLHF

Practical_RL

awesome-RLHF

What are some alternatives?