cramming
extreme-bert
cramming | extreme-bert | |
---|---|---|
6 | 2 | |
1,238 | 283 | |
- | 0.0% | |
7.3 | 0.0 | |
16 days ago | about 1 year ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cramming
- [P] Notes on training BERT from scratch on an 8GB consumer GPU
- Cramming the training of a (BERT-type) language model into limited compute
- NanoGPT
-
New AI Research from the University of Maryland Investigates Cramming Challenge for Training a Language Model on a Single GPU in One Day
Quick Read: https://www.marktechpost.com/2023/01/03/new-ai-research-from-the-university-of-maryland-investigates-cramming-challenge-for-training-a-language-model-on-a-single-gpu-in-one-day/ Paper: https://arxiv.org/pdf/2212.14034.pdf Github: https://github.com/JonasGeiping/cramming
- Lucas Beyer on Twitter: “How Good of a Bert Can One Get in One Day on One GPU?
-
Cramming: Training a Language Model on a Single GPU in One Day - Jonas Geiping and Tom Goldstein University of Maryland 2022
Github: https://github.com/JonasGeiping/cramming
extreme-bert
-
[P] Releasing customized language model pre-training acceleration toolkit: ExtremeBERT
Found relevant code at https://github.com/extreme-bert/extreme-bert + all code implementations here
What are some alternatives?
nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.
torchscale - Foundation Architecture for (M)LLMs
askai - Command Line Interface for OpenAi ChatGPT
primeqa - The prime repository for state-of-the-art Multilingual Question Answering research and development.
english-lang - The English Programming Language
pixel - Research code for pixel-based encoders of language (PIXEL)
aitextgen - A robust Python tool for text-based AI training and generation using GPT-2.
bertviz - BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
flan-ul2-alpaca
RATransformers - RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!
askai - Your simple terminal helper - A CLI integration with OpenAI's GPT3
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.