PaLM-rlhf-pytorch VS trlx

Compare PaLM-rlhf-pytorch vs trlx and see what are their differences.

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM (by lucidrains)

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF) (by CarperAI)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
PaLM-rlhf-pytorch trlx
25 5
7,593 4,324
- 1.1%
4.6 7.9
4 months ago 4 months ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

PaLM-rlhf-pytorch

Posts with mentions or reviews of PaLM-rlhf-pytorch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-18.

trlx

Posts with mentions or reviews of trlx. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-09.

What are some alternatives?

When comparing PaLM-rlhf-pytorch and trlx you can also consider the following projects:

nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.

alpaca-lora - Instruct-tune LLaMA on consumer hardware

GLM-130B - GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

trl - Train transformer language models with reinforcement learning.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

RL4LMs - A modular RL library to fine-tune language models to human preferences

ggml - Tensor library for machine learning

Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

summarize-from-feedback - Code for "Learning to summarize from human feedback"

Rath - Next generation of automated data exploratory analysis and visualization platform.

gigagan-pytorch - Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs