lm-human-preferences vs gpt-2

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences (by openai)

Suggest topics

Source Code

openai.com

Suggest alternative

Edit details

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners" (by openai)

Paper

Source Code

openai.com

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

lm-human-preferences		gpt-2
	Project
8	Mentions	63
1,106	Stars	21,111
5.3%	Growth	1.9%
2.7	Activity	2.5
9 months ago	Latest Commit	16 days ago
Python	Language	Python
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

lm-human-preferences

Posts with mentions or reviews of lm-human-preferences. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-14.

Ask HN: Open-source GPT-3 alternatives
4 projects | news.ycombinator.com | 14 Feb 2023
El éxito continuo de OpenAI: Y como llegaron a crear la IA más avanzada del 2023. ChatGPT.
2 projects | dev.to | 2 Feb 2023
Sam Altman on the best and worst case scenario for AI - "...the good case is just so unbelievably good that you sound like a really crazy person to start talking about it."
1 project | /r/singularity | 21 Jan 2023

Lest you think that that sounds like a too galaxy-brained possibility, it has already happened at OpenAI (scroll down to "Bugs can optimize for bad behavior"), just with a model that was very far from being capable enough to be dangerous.
Value head in GPT2
2 projects | /r/reinforcementlearning | 4 Oct 2022

Found relevant code at https://github.com/openai/lm-human-preferences + all code implementations here
Should we stick to the devil we know?
2 projects | /r/TheMotte | 11 Aug 2022

That's why, when they're serious, they use RL for finetuning from human preferences (would be hilarious if this attempt to solve the terrible bias you take to be evidence of AGI threat ends up creating a Woke Singleton itself, btw); it's a powerful general approach, and I see no sign of it being applied here.
Dall-E 2
16 projects | news.ycombinator.com | 6 Apr 2022

The kind of measures they are taking, like simply deleting wholesale anything problematic, don't really have a '-1'.
But amusingly, exactly that did happen in one of their GPT experiments! https://openai.com/blog/fine-tuning-gpt-2/
Discussion Thread
1 project | /r/neoliberal | 5 Apr 2022
[D] Applications for using reinforcement learning to fine-tune GPT-2
2 projects | /r/MachineLearning | 19 Mar 2022

Code for https://arxiv.org/abs/1909.08593 found: https://github.com/openai/lm-human-preferences

gpt-2

Posts with mentions or reviews of gpt-2. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-20.

Sam Altman is still trying to return as OpenAI CEO
2 projects | news.ycombinator.com | 20 Nov 2023
Build Personal ChatGPT Using Your Data
14 projects | news.ycombinator.com | 8 Jul 2023
Are the recent advancements in AI technology primarily driven by recent discoveries or the progress in hardware capabilities and the abundance of available data?
1 project | /r/ArtificialInteligence | 3 Jun 2023

"Our model, called GPT-2 (a successor to GPT), was trained simply to predict the next word in 40GB of Internet text. Due to our concerns about malicious applications of the technology, we are not releasing the trained model. As an experiment in responsible disclosure, we are instead releasing a much smaller model for researchers to experiment with, as well as a technical paper. "
BING IS NOW THE DEFAULT SEARCH FOR CHATGPT
1 project | /r/ChatGPT | 24 May 2023

They did release GPT-2 under the MIT License.
Don Knuth Plays with ChatGPT
6 projects | news.ycombinator.com | 20 May 2023

Did you arrive at this certainty through reading something other than what OpenAI has published? The document [0] that describes the training data for GPT-2 makes this assertion hilarious to me.
[0]: https://github.com/openai/gpt-2/blob/master/model_card.md#da...
Was frustriert euch an der Nutzung oder der Diskussion um KI?
1 project | /r/KI_Welt | 17 May 2023
The AI
1 project | /r/ProgrammerHumor | 11 Apr 2023
Help with pet project to learn - Running ChatGPT-2 at home
1 project | /r/learnmachinelearning | 11 Apr 2023

I made a clone of https://github.com/openai/gpt-2 on my local laptop
По поводу опасности ИИ и предложений остановить разработки на 6 месяцев.
3 projects | /r/tjournal_refugees | 30 Mar 2023
Elon Musk, Y Bengio, Andrew Yang etc called for a temporary pause on training systems exceeding GPT-4
1 project | /r/ChatGPT | 29 Mar 2023

Elon's 100M put this in the arena. https://github.com/openai/gpt-2

What are some alternatives?

When comparing lm-human-preferences and gpt-2 you can also consider the following projects:

trl - Train transformer language models with reinforcement learning.

dalle-mini - DALL·E Mini - Generate images from a text prompt

GLM-130B - GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time

tensorrtx - Implementation of popular deep learning networks with TensorRT network definition API

gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

glide-text2im - GLIDE: a diffusion-based text-conditional image synthesis model

sentencepiece - Unsupervised text tokenizer for Neural Network-based text generation.

dalle-2-preview

jukebox - Code for the paper "Jukebox: A Generative Model for Music"

lm-human-preferences vs trl gpt-2 vs dalle-mini lm-human-preferences vs GLM-130B gpt-2 vs minGPT lm-human-preferences vs dalle-mini gpt-2 vs Real-Time-Voice-Cloning lm-human-preferences vs tensorrtx gpt-2 vs gpt-neo lm-human-preferences vs glide-text2im gpt-2 vs sentencepiece lm-human-preferences vs dalle-2-preview gpt-2 vs jukebox

Compare lm-human-preferences vs gpt-2 and see what are their differences.

lm-human-preferences

gpt-2

lm-human-preferences

gpt-2

What are some alternatives?