SillyTavern
annoy_ltm
SillyTavern | annoy_ltm | |
---|---|---|
75 | 3 | |
677 | 32 | |
- | - | |
10.0 | 6.5 | |
12 months ago | 10 months ago | |
JavaScript | Python | |
GNU Affero General Public License v3.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SillyTavern
-
Help😢
Go to Termix and click Exit. Then go to Termux and code 1. Apk update 2. Apk upgrade 3. git clone https://github.com/Cohee1207/SillyTavern 4. cd SillyTavern 5. Install nodejs 6. Npm install 7. Node server
-
Oogabooga and llama.cpp in longer conversations answers take forever.....
If you want the best roleplaying experience, I can only recommend SillyTavern with SillyTavern/SillyTavern-extras. The extras include summarization and ChromaDB, both helping to get longer and more coherent chats.
-
koboldcpp-1.33 Ultimate Edition released!
Really? Then we definitely have different experiences (or different ways to interact) with Guanaco. It's been the most unrestricted model I've tried, and I tried them all, but I'm using SillyTavern and the simple-proxy-for-tavern which combined with a little prompting liberates basically any model.
-
The best 13B model for rolepay?
Why reinvent the wheel? Just use SillyTavern, ideally with the simple-proxy-for-tavern. That does it all, and more.
-
airoboros gpt4 v1.2
I tested this today in an hours-long direct roleplay comparison between q3_K_M quants of TheBloke/airoboros-65B-gpt4-1.2-GGML and TheBloke/guanaco-65B-GGML, using koboldcpp as backend together with simple-proxy-for-tavern and SillyTavern as frontend.
-
What are you using for RP?
I'm using SillyTavern frontend and simple-proxy-for-tavern with koboldcpp backend.
-
KoboldCPP Updated to Support K-Quants, new bonus CUDA build.
I'm using SillyTavern frontend and simple-proxy-for-tavern with koboldcpp. Not sure which of these has solved the prompt-reprocessing problem, but I no longer have these slowdowns.
-
What are your favorite LLMs?
WizardLM 30B V1.0 is not only smarter and follows instructions better than the others, it's even uncensored when used with an uncensoring character card (I use SillyTavern as my GUI/frontend) - more so than any other model I tested. Probably because it follows instructions so well, thus roleplaying an uncensored character properly (and not breaking character or going "as an AI" even once during my tests).
-
Potato's brain guide to installing and reopening SillyTavern for Mac
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.39.3/install.sh | bash export NVM_DIR="$([ -z "${XDG_CONFIG_HOME-}" ] && printf %s "${HOME}/.nvm" || printf %s "${XDG_CONFIG_HOME}/nvm")" [ -s "$NVM_DIR/nvm.sh" ] && \. "$NVM_DIR/nvm.sh" nvm install node git clone -b dev https://github.com/Cohee1207/SillyTavern && cd SillyTavern npm i && node server.js
-
I've found a solution to Poe API error
For Android (Termux users): 1. apt update 2. apt upgrade 3. Type "y" to everything and hit enter 4. pkg install git 5. git clone -b dev https://github.com/Cohee1207/SillyTavern 6. cd SillyTavern 7. pkg install nodejs 8. npm install 9. bash start.sh
annoy_ltm
- Looking for the long-term memory extension.
-
I created a memory system to let your chat bots remember past interactions in a human like way.
I know very little about the difference, but I use GPTQ models so it's possible. There is one issue on the repo about something similar https://github.com/YenRaven/annoy_ltm/issues/7 If this is what you are seeing, then I think I know the issue. There is a dimensionality to the embeddings output that I kinda thought was just a magic number for most of the development, but found out there is a config on the models I use that has just the number I needed. It's possible GGML models have no such config or a different structure. I'm planing on adding a setting to the extension that will let you override this value which hopefully will fix this issue.
What are some alternatives?
koboldcpp - A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
long_term_memory - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
TavernAI - TavernAI for nerds [Moved to: https://github.com/Cohee1207/SillyTavern]
SillyTavern - LLM Frontend for Power Users.
langflow - ⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
character-editor - Create, edit and convert AI character files for CharacterAI, Pygmalion, Text Generation, KoboldAI and TavernAI
superbig - A prompt/context management system
simple-proxy-for-tavern
SillyTavern-Extras - Extensions API for SillyTavern.
ChatRWKV - ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
private-gpt - Interact with your documents using the power of GPT, 100% privately, no data leaks