Python gpt-2

Open-source Python projects categorized as gpt-2

Top 23 Python gpt-2 Projects

  1. RWKV-LM

    RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. LoRA

    Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

    Project mention: Visually Multilingual: Introducing mcdse-2b | dev.to | 2024-10-27

    mcdse-2b is trained from MrLight/dse-qwen2-2b-mrl-v1 using low-rank adapters (LoRA) on a multilingual corpus of documents. I have trained it on 8xRTX3090 using the DSE approach with the following parameters:

  4. GPT2-Chinese

    Chinese version of GPT2 training code, using BERT tokenizer.

  5. awesome-pretrained-chinese-nlp-models

    Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

  6. picoGPT

    An unnecessarily tiny implementation of GPT-2 in NumPy.

    Project mention: Quick tip: Using picoGPT in the SingleStore portal | dev.to | 2024-06-23

    picoGPT is a simplified and minimal implementation of the GPT model. It demonstrates the core principles of the GPT architecture without the requirement for a full-scale implementation. Written in Python and consisting of a small quantity of code, picoGPT doesn't implement many of the optimisations and enhancements found in comprehensive implementations. In this short article, we'll convert the original Python code to a Jupyter notebook and test it in the SingleStore portal.

  7. xTuring

    Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

  8. DialoGPT

    Large-scale pretraining for dialogue

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. Code-LMs

    Guide to using pre-trained large language models of source code

  11. transfer-learning-conv-ai

    🦄 State-of-the-Art Conversational AI with Transfer Learning

  12. TencentPretrain

    Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

  13. this-word-does-not-exist

    This Word Does Not Exist

    Project mention: Ask HN: How do you name software? | news.ycombinator.com | 2024-02-10
  14. TextRL

    Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

  15. DialogRPT

    EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

  16. MAGIC

    Language Models Can See: Plugging Visual Controls in Text Generation (by yxuansu)

  17. tabnine-sublime

    Tabnine Autocomplete AI: JavaScript, Python, TypeScript, PHP, C/C++, HTML/CSS, Go, Java, Ruby, C#, Rust, SQL, Bash, Kotlin, Julia, Lua, OCaml, Perl, Haskell, React

    Project mention: 5 Free AI Coding Copilots to Help You Fly Out of the Dev Blackhole | dev.to | 2024-06-18

    Sublime Text

  18. CapDec

    CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

    Project mention: Open source – Unsupervised captioning getting closer to supervised captioning | news.ycombinator.com | 2024-04-20
  19. transformer-lm

    Transformer language model (GPT-2) with sentencepiece tokenizer

  20. pistoBot

    Create an AI that chats like you

  21. openai-detector

    AI classifier for indicating AI-written text

  22. nanoChatGPT

    nanogpt turned into a chat model (by VatsaDev)

  23. namekrea

    NameKrea is an AI Domain Name Generator which uses GPT-2

    Project mention: The best domain name generators on the web | dev.to | 2024-05-19

    Namekrea is an open-source AI domain name generator powered by GPT-2, helping users find creative and available domain names by intelligently combining keywords for various projects.

  24. AdaVAE

    [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation

  25. Extracting-Training-Data-from-Large-Langauge-Models

    A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python gpt-2 discussion

Log in or Post with

Python gpt-2 related posts

  • Visually Multilingual: Introducing mcdse-2b

    2 projects | dev.to | 27 Oct 2024
  • Quick tip: Using picoGPT in the SingleStore portal

    2 projects | dev.to | 23 Jun 2024
  • A look at Apple's technical approach to AI including core model performance etc.

    2 projects | news.ycombinator.com | 14 Jun 2024
  • Open source – Unsupervised captioning getting closer to supervised captioning

    1 project | news.ycombinator.com | 20 Apr 2024
  • DECT NR+: A technical dive into non-cellular 5G

    1 project | news.ycombinator.com | 2 Apr 2024
  • Training LLMs Taking Too Much Time? Technique you need to know to train it faster

    1 project | dev.to | 3 Mar 2024
  • Ask HN: How do you name software?

    1 project | news.ycombinator.com | 10 Feb 2024
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 6 Feb 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →

Index

What are some of the best open-source gpt-2 projects in Python? This list will help you:

# Project Stars
1 RWKV-LM 13,086
2 LoRA 11,193
3 GPT2-Chinese 7,506
4 awesome-pretrained-chinese-nlp-models 5,060
5 picoGPT 3,302
6 xTuring 2,627
7 DialoGPT 2,365
8 Code-LMs 1,805
9 transfer-learning-conv-ai 1,742
10 TencentPretrain 1,056
11 this-word-does-not-exist 1,019
12 TextRL 549
13 DialogRPT 337
14 MAGIC 255
15 tabnine-sublime 197
16 CapDec 189
17 transformer-lm 164
18 pistoBot 143
19 openai-detector 122
20 nanoChatGPT 62
21 namekrea 49
22 AdaVAE 34
23 Extracting-Training-Data-from-Large-Langauge-Models 33

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai