PaLM-rlhf-pytorch VS RL4LMs

Compare PaLM-rlhf-pytorch vs RL4LMs and see what are their differences.

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM (by lucidrains)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
PaLM-rlhf-pytorch RL4LMs
25 5
7,593 2,094
- 2.2%
4.6 0.0
4 months ago 2 months ago
Python Python
MIT License Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

PaLM-rlhf-pytorch

Posts with mentions or reviews of PaLM-rlhf-pytorch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-18.

RL4LMs

Posts with mentions or reviews of RL4LMs. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-12.

What are some alternatives?

When comparing PaLM-rlhf-pytorch and RL4LMs you can also consider the following projects:

nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.

trlx - A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

GLM-130B - GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Dromedary - Dromedary: towards helpful, ethical and reliable LLMs.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

valhalla-nmt - Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"

ggml - Tensor library for machine learning

Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

awesome-transformer-nlp - A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

NeMo-Guardrails - NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.