Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →
Top 23 Python gpt-2 Projects
-
RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
mcdse-2b is trained from MrLight/dse-qwen2-2b-mrl-v1 using low-rank adapters (LoRA) on a multilingual corpus of documents. I have trained it on 8xRTX3090 using the DSE approach with the following parameters:
-
-
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
-
picoGPT is a simplified and minimal implementation of the GPT model. It demonstrates the core principles of the GPT architecture without the requirement for a full-scale implementation. Written in Python and consisting of a small quantity of code, picoGPT doesn't implement many of the optimisations and enhancements found in comprehensive implementations. In this short article, we'll convert the original Python code to a Jupyter notebook and test it in the SingleStore portal.
-
xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
-
-
TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
-
-
-
tabnine-sublime
Tabnine Autocomplete AI: JavaScript, Python, TypeScript, PHP, C/C++, HTML/CSS, Go, Java, Ruby, C#, Rust, SQL, Bash, Kotlin, Julia, Lua, OCaml, Perl, Haskell, React
Project mention: 5 Free AI Coding Copilots to Help You Fly Out of the Dev Blackhole | dev.to | 2024-06-18Sublime Text
-
Project mention: Open source – Unsupervised captioning getting closer to supervised captioning | news.ycombinator.com | 2024-04-20
-
-
-
-
-
Namekrea is an open-source AI domain name generator powered by GPT-2, helping users find creative and available domain names by intelligently combining keywords for various projects.
-
AdaVAE
[Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation
-
Extracting-Training-Data-from-Large-Langauge-Models
A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python gpt-2 discussion
Python gpt-2 related posts
-
Visually Multilingual: Introducing mcdse-2b
-
Quick tip: Using picoGPT in the SingleStore portal
-
A look at Apple's technical approach to AI including core model performance etc.
-
Open source – Unsupervised captioning getting closer to supervised captioning
-
DECT NR+: A technical dive into non-cellular 5G
-
Training LLMs Taking Too Much Time? Technique you need to know to train it faster
-
Ask HN: How do you name software?
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 6 Feb 2025
Index
What are some of the best open-source gpt-2 projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | RWKV-LM | 13,086 |
2 | LoRA | 11,193 |
3 | GPT2-Chinese | 7,506 |
4 | awesome-pretrained-chinese-nlp-models | 5,060 |
5 | picoGPT | 3,302 |
6 | xTuring | 2,627 |
7 | DialoGPT | 2,365 |
8 | Code-LMs | 1,805 |
9 | transfer-learning-conv-ai | 1,742 |
10 | TencentPretrain | 1,056 |
11 | this-word-does-not-exist | 1,019 |
12 | TextRL | 549 |
13 | DialogRPT | 337 |
14 | MAGIC | 255 |
15 | tabnine-sublime | 197 |
16 | CapDec | 189 |
17 | transformer-lm | 164 |
18 | pistoBot | 143 |
19 | openai-detector | 122 |
20 | nanoChatGPT | 62 |
21 | namekrea | 49 |
22 | AdaVAE | 34 |
23 | Extracting-Training-Data-from-Large-Langauge-Models | 33 |