Behavior-Sequence-Transformer-Pytorch vs Basic-UI-for-GPT-J-6B-with-low-vram

Behavior-Sequence-Transformer-Pytorch

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf (by jiwidi)

Basic-UI-for-GPT-J-6B-with-low-vram

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram. (by arrmansa)

gpt-neo Gpt Transformers

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

Behavior-Sequence-Transformer-Pytorch		Basic-UI-for-GPT-J-6B-with-low-vram
	Project
1	Mentions	4
129	Stars	113
-	Growth	-
0.0	Activity	0.0
almost 2 years ago	Latest Commit	over 2 years ago
Jupyter Notebook	Language	Jupyter Notebook
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Behavior-Sequence-Transformer-Pytorch

Posts with mentions or reviews of Behavior-Sequence-Transformer-Pytorch. We have used some of these posts to build our list of alternatives and similar projects.

AI Creating recommendations for MovieLens with Alibaba transformer model!
1 project | /r/learnmachinelearning | 17 Jul 2021

Basic-UI-for-GPT-J-6B-with-low-vram

Posts with mentions or reviews of Basic-UI-for-GPT-J-6B-with-low-vram. We have used some of these posts to build our list of alternatives and similar projects.

How to run this service with a local GPU?
1 project | /r/PygmalionAI | 27 Jan 2023

You need a lot of VRAM to run the AI models, scaling somewhat with the amount of parameters a model uses. The most advanced model Pygmalion has is 6 billion parameters, which requires a minimum of 16GB of VRAM to run locally at decent speeds. There are methods of running 6b locally on low VRAM machines as listed here: https://github.com/arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram but even then, the generations would be excruciatingly slow, and the lowest VRAM card used with this method has 6GB of VRAM.
Tesla M40 and GPT-J-6B
1 project | /r/KoboldAI | 8 Aug 2021

While waiting however I came across https://github.com/arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram which allows you to use some of system memory to run the model. I was able to get a version working with 2.7B on my 2060 6GB with KoboldAI. The github above has an error that prevents it from working (https://github.com/arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram/issues/1), but other than that it works.
How is any of this even possible?
1 project | /r/GPT3 | 21 Jul 2021

Just to add to this, there is a low VRAM version of GPT-J here (suggest 16GB RAM + 8GB GPU).
GPT-J 6B locally on my computer
1 project | /r/KoboldAI | 25 Jun 2021

I found this yesterday, is it somehow possible to use this with KoboldAI to run GPT-J on weaker graphics cards?

What are some alternatives?

When comparing Behavior-Sequence-Transformer-Pytorch and Basic-UI-for-GPT-J-6B-with-low-vram you can also consider the following projects:

Stock-Prediction-Models - Gathers machine learning and deep learning models for Stock forecasting including trading bots and simulations

gpt-neo_dungeon - Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B

pytorch-seq2seq - Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

adaptnlp - An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.

nn - 🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

clip-italian - CLIP (Contrastive Language–Image Pre-training) for Italian

pytorch-sentiment-analysis - Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

pytorch-generative - Easy generative modeling in PyTorch.

Eleya - Artificial Intelligence That Generate Novel Biomedical Text

Behavior-Sequence-Transformer-Pytorch vs Stock-Prediction-Models Basic-UI-for-GPT-J-6B-with-low-vram vs gpt-neo_dungeon Behavior-Sequence-Transformer-Pytorch vs pytorch-seq2seq Basic-UI-for-GPT-J-6B-with-low-vram vs adaptnlp Behavior-Sequence-Transformer-Pytorch vs nn Basic-UI-for-GPT-J-6B-with-low-vram vs clip-italian Behavior-Sequence-Transformer-Pytorch vs pytorch-sentiment-analysis Basic-UI-for-GPT-J-6B-with-low-vram vs pytorch-sentiment-analysis Basic-UI-for-GPT-J-6B-with-low-vram vs nn Basic-UI-for-GPT-J-6B-with-low-vram vs pytorch-generative Basic-UI-for-GPT-J-6B-with-low-vram vs Eleya

Compare Behavior-Sequence-Transformer-Pytorch vs Basic-UI-for-GPT-J-6B-with-low-vram and see what are their differences.

Behavior-Sequence-Transformer-Pytorch

Basic-UI-for-GPT-J-6B-with-low-vram

Behavior-Sequence-Transformer-Pytorch

Basic-UI-for-GPT-J-6B-with-low-vram

What are some alternatives?