dockerLLM
koboldcpp
dockerLLM | koboldcpp | |
---|---|---|
5 | 180 | |
286 | 4,133 | |
0.7% | - | |
7.3 | 10.0 | |
3 months ago | 6 days ago | |
Shell | C++ | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dockerLLM
-
Local VS Cloud?
I use Runpod now. TheBloke provides some templates that make it easy to start: https://github.com/TheBlokeAI/dockerLLM/.
-
Free LLM api
or use TheBloke Local LLMs One-Click UI template on runpod https://github.com/TheBlokeAI/dockerLLM/blob/main/README_Runpod_LocalLLMsUIandAPI.md
- oobabooga Update broke loading u/The-Bloke huggingface models?
-
"Samantha-33B-SuperHOT-8K-GPTQ" now that's a great name for a true model.
The one thing I have published is my Docker files for producing my two Runpod templates, which let people try GGML and GPTQ models on Runpod pods with full GPU acceleration (ExLlama and AutoGPTQ). They can be found at https://github.com/TheBlokeAI/dockerLLM/ .
-
OpenLLaMA 13B Released
https://www.runpod.io/console/templates
This is the readme for the one I mentioned: https://github.com/TheBlokeAI/dockerLLM/blob/main/README_Run...
> can I use Colab/Huggingface GPUs?
You use these templates on the runpod platform itself. Theres no free tier equivalent like you have with Colab/HF, but currently you can rent an RTX 4090 for $0.69/hr so its pretty affordable.
koboldcpp
- Any Online Communities on Local/Home AI?
- Koboldcpp-1.62.1 adds support for Command-R+
- Show HN: I made an app to use local AI as daily driver
-
Easiest way to show my model to my mom?
FYI this is the easiest way to host on the horde: https://github.com/LostRuins/koboldcpp
- IT Veteran... why am I struggling with all of this?
- What do you use to run your models?
- ByteDance AI researcher suggests that open source model more powerful than Gemini to be released soon
- i need some help guys
-
[Guide] How install KoboldAI in Android via Termux (Update 04-12-2023)
For more information of Koboldcpp look this guide: https://github.com/LostRuins/koboldcpp/wiki
-
SillyTavern 1.10.10 has been released
Out of curiosity, is there a specific reason for this? The most popular fork KoboldCpp is in active development, and was the first to adopt the Min P sampler, and even distincts itself with the context shift feature. Just wondering what this means for the future. Thanks!
What are some alternatives?
llama.cpp - LLM inference in C/C++
KoboldAI
lm-evaluation-harness - A framework for few-shot evaluation of language models.
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
TavernAI - Atmospheric adventure chat for AI language models (KoboldAI, NovelAI, Pygmalion, OpenAI chatgpt, gpt-4)
alpaca_lora_4bit
KoboldAI - KoboldAI is generative AI software optimized for fictional use, but capable of much more!
open_llama - OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
ChatRWKV - ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
SillyTavern - LLM Frontend for Power Users. [Moved to: https://github.com/SillyTavern/SillyTavern]
exllama - A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.