finetune-gpt2xl vs aitg

finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed (by Xirider)

Source Code

Suggest alternative

Edit details

aitg

plug and play many transformers models for http api or command line! (by bmchtech)

gpt2 gpt3 aitextgen Pytorch Transformers huggingface generative-model

DISCONTINUED

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

finetune-gpt2xl		aitg
	Project
9	Mentions	1
421	Stars	4
-	Growth	-
0.0	Activity	0.0
11 months ago	Latest Commit	over 1 year ago
Python	Language	Python
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

finetune-gpt2xl

Posts with mentions or reviews of finetune-gpt2xl. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-13.

Fine-tuning?
2 projects | /r/KoboldAI | 13 Feb 2023

git clone the finetuning repo https://github.com/Xirider/finetune-gpt2xl go into the finetuning repo, install the rest of the requirements, pip install -r requirements.txt
Training text-generating models locally
1 project | /r/LanguageTechnology | 11 Dec 2021
Dataset For GPT Fine-Tuning
1 project | /r/MLQuestions | 15 Nov 2021

I would like to understand a little better how to organize texts for Fine-Tuning, especially for GPT Neo. I plan to use this repo procedure, where is the following notice,
How to share the finetuned model
1 project | /r/GPT_Neo | 8 Nov 2021

In the code suggested in the video (and in the repo) the flag --fp16 is used. But reading the "DeepSpeed Integration" article it is said that,
[D] I made a script that does all the work to deploy GPT-NEO on Windows 10. (Please Test)
1 project | /r/MachineLearning | 19 Apr 2021
[Project] Estimating fine-tuning cost
1 project | /r/MachineLearning | 4 Apr 2021

Finetuning GPT-NEO 2.7B on Wikitext (180mb) took me about 45 minutes on one preemptible V100 instance on google cloud. It cost 1.30$ per hour and therefore around 1 $. Here are the steps: https://github.com/Xirider/finetune-gpt2xl
[P] Guide: Finetune GPT2-XL (1.5 Billion Parameters, the biggest model) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed
1 project | /r/MachineLearning | 28 Mar 2021

Here i explain the setup and commands to get it running: https://github.com/Xirider/finetune-gpt2xl

1 project | /r/MachineLearning | 28 Mar 2021
Guide: Finetune GPT2-XL (1.5 Billion Parameters, the biggest model) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed
1 project | /r/MachineLearning | 28 Mar 2021

aitg

Posts with mentions or reviews of aitg. We have used some of these posts to build our list of alternatives and similar projects.

Show HN: Plug-and-play GPT2/3 CLI and HTTP, made for local hacking
1 project | news.ycombinator.com | 27 Aug 2021

What are some alternatives?

When comparing finetune-gpt2xl and aitg you can also consider the following projects:

detoxify - Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at [email protected].

nvc-gpt3-chat - This is the code I used to create a small private SMS text chat system that employs GPT3 from OpenAI and "nonviolent communication", an algorithmically based method of conflict resolution. Hopefully the chat helps users process conflict.

Extracting-Training-Data-from-Large-Langauge-Models - A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020

ALAE - [CVPR2020] Adversarial Latent Autoencoders

bertviz - BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

jukebox - Code for the paper "Jukebox: A Generative Model for Music"

kogpt - KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

finetune-gpt2xl vs detoxify aitg vs nvc-gpt3-chat finetune-gpt2xl vs Extracting-Training-Data-from-Large-Langauge-Models aitg vs ALAE aitg vs bertviz aitg vs jukebox aitg vs kogpt

Compare finetune-gpt2xl vs aitg and see what are their differences.

finetune-gpt2xl

aitg

finetune-gpt2xl

aitg

What are some alternatives?