gpt2bot
Transformers-Tutorials
gpt2bot | Transformers-Tutorials | |
---|---|---|
1 | 7 | |
425 | 7,875 | |
- | - | |
0.0 | 8.4 | |
2 months ago | 6 days ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gpt2bot
-
Anyone knows how to do GPT3 persona?
Anyone knows how to do GPT3 persona? I am seeking a solution to apply GPT3 persona on DialogRPT (there is a bot based on it : https://github.com/polakowo/gpt2bot). I am willing to pay for this solution, thanks!
Transformers-Tutorials
-
AI enthusiasm #6 - Finetune any LLM you want๐ก
Most of this tutorial is based on Hugging Face course about Transformers and on Niels Rogge's Transformers tutorials: make sure to check their work and give them a star on GitHub, if you please โค๏ธ
- FLaNK Stack Weekly for 07August2023
- How to annotate compound words to build NER models?
-
[discussion] Anybody Working with VITMAE?
I'm pretraining on 850K grayscale spectrograms of birdsongs. I'm on epoch 400 out of 800 and the loss has declined from about 1.2 to 0.7. I don't really have a sense of what is "good enough" and I guess the only way I can judge is by looking at the reconstruction. I'm doing that using this notebook as a guide and right now it's doing quite badly.
-
[D] NLP has HuggingFace, what does Computer Vision have?
More tutorials can be found at https://github.com/NielsRogge/Transformers-Tutorials.
-
[Discussion] Information Extraction with LayoutLMv2
Ive been looking for an off the shelf encoder-decoder document understanding model for key information extraction. I found a great Huggingface implementation with concise notebook examples. However, the token classification model outputs a list of token labels corresponding bounding boxes for the token, but, not the text contained within the labeled bounding boxes themselves. Am I missing something? LayoutLMv2 describes itself as being capable of information extraction but without extracting the text I feel like it's fallen short of that ambition.
-
[Project] Deepmind's Perceiver IO available through Hugging Face
Example Notebooks
What are some alternatives?
RecipeGPT-exp - RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System (TheWebConf'2020; WWW'20)
nn - ๐งโ๐ซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
finetuned-qlora-falcon7b-medical - Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
gorilla-cli - LLMs for your CLI
tensorflow-nanoGPT - Example how to train GPT-2 (XLA + AMP), export to SavedModel and serve with Tensorflow Serving
pytorch-image-models - PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
mgpt - Multilingual Generative Pretrained Model
notebooks - Notebooks using the Hugging Face libraries ๐ค
elon-bot - Discord AI bot capable of chatting and moderating, trained on conversation transcripts of Elon Musk
adaptnlp - An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
OpenBuddy - Open Multilingual Chatbot for Everyone
ToolBench - [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.