DialoGPT
educational-transformer
Our great sponsors
DialoGPT | educational-transformer | |
---|---|---|
7 | 1 | |
2,315 | 2 | |
1.0% | - | |
0.0 | 3.8 | |
over 1 year ago | 7 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DialoGPT
- Just a thought
-
Mycroft AI companion
They recommend using https://github.com/microsoft/DialoGPT now, btw. That appears to be current and maintained, plus on a transformer (vs rnn only). Might be better long-term to migrate.
- DialoGPT finetuned on my own message data
-
I made a Python tool to help you know what to say!
I learned about GPT-3 and its strength as a generative model but couldn't access it yet (can't afford the API). Thankfully I found a GPT-2 based pre-trained model DialoGPT that was trained on Reddit.
-
AIstiny: The ultimate debate bot. Phase 0: Viability and call to the community for help
The most readily available technology to create a chatbot of the type we want (due to the type of data we have) is GPT-2 based DialoGPT. There are many papers and examples, for example this. If you have never heard of GPT-2 before, maybe you have heard of AI Dungeon, although currently it is run on GPT-3, the initial versions were based on GPT-2
-
Telegram Client+Bot that use Artificial Intelligence to waste scammers time
No it uses a chat AI based on GPT2 called DialoGPT
- [P] H5Records : Store large datasets in one single files with index access
educational-transformer
What are some alternatives?
pistoBot - Create an AI that chats like you
awesome-pretrained-chinese-nlp-models - Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
GPT2-Chinese - Chinese version of GPT2 training code, using BERT tokenizer.
gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
DialogRPT - EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
RWKV-LM - RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
GODEL - Large-scale pretrained models for goal-directed dialog
LoRA - Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
forte - Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
namekrea - NameKrea is an AI Domain Name Generator which uses GPT-2
conversation-helper - GUI implementation of a Transformer chatbot. Suggests amicable responses to messages from friends.
cakechat - CakeChat: Emotional Generative Dialog System