gpt-2 VS gpt-2-training

Compare gpt-2 vs gpt-2-training and see what are their differences.

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners" (by openai)

gpt-2-training

Training GPT-2 on a Russian language corpus (by l4rz)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
gpt-2 gpt-2-training
63 1
21,111 85
1.9% -
2.5 1.8
19 days ago over 3 years ago
Python Python
GNU General Public License v3.0 or later -
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt-2

Posts with mentions or reviews of gpt-2. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-20.

gpt-2-training

Posts with mentions or reviews of gpt-2-training. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-29.
  • NEED HELP IN GPT 2
    3 projects | /r/programminghelp | 29 Apr 2021
    ok so this is one part of it the next is https://github.com/l4rz/gpt-2-training and this https://github.com/openai/gpt-2 i had to change some stuff to get the 345m perimeter model

What are some alternatives?

When comparing gpt-2 and gpt-2-training you can also consider the following projects:

dalle-mini - DALL·E Mini - Generate images from a text prompt

minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time

gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

sentencepiece - Unsupervised text tokenizer for Neural Network-based text generation.

jukebox - Code for the paper "Jukebox: A Generative Model for Music"

mesh-transformer-jax - Model parallel transformers in JAX and Haiku

gpt-neox - An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

stylegan2-pytorch - Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

dalle-2-preview

tensorrtx - Implementation of popular deep learning networks with TensorRT network definition API