SaaSHub helps you find the best software and product alternatives Learn more →
Koboldcpp Alternatives
Similar projects and alternatives to koboldcpp
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
textgen
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
-
ollama
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
-
-
-
-
-
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
-
SillyTavern
Discontinued LLM Frontend for Power Users. [Moved to: https://github.com/SillyTavern/SillyTavern] (by Cohee1207)
-
-
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
-
-
simple-proxy-for-tavern
Discontinued [GET https://api.github.com/repos/anon998/simple-proxy-for-tavern: 404 - Not Found // See: https://docs.github.com/rest/repos/repos#get-a-repository]
-
Local-LLM-Comparison-Colab-UI
Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.
-
-
TavernAI
Discontinued TavernAI for nerds [Moved to: https://github.com/Cohee1207/SillyTavern] (by SillyLossy)
-
koboldcpp discussion
koboldcpp reviews and mentions
-
Best Free AI Chatbots Without Login (over TOR and Anonymous)
https://github.com/LostRuins/koboldcpp Download models at HuggingFace and run them locally. No logins, no spying, no hidden data harvesting.
-
LM Studio is now an MCP Host
Oh, that horrible Electron UI. Under Windows it pegs a core on my CPU at all times!
If you're just working as a single user via the OpenAI protocol, you might want to consider koboldcpp. It bundles a GUI launcher, then starts in text-only mode. You can also tell it to just run a saved configuration, bypassing the GUI; I've successfully run it as a system service on Windows using nssm.
https://github.com/LostRuins/koboldcpp/releases
Though there are a lot of roleplay-centric gimmicks in its feature set, its context-shifting feature is singular. It caches the intermediate state used by your last query, extending it to build the next one. As a result you save on generation time with large contexts, and also any conversation that has been pushed out of the context window still indirectly influences the current exchange.
- LostRuins/koboldcpp: Run GGUF models easily with a KoboldAI UI
-
Hosting HuggingFace Models with KoboldCpp and RunPod
KoboldCpp is a popular text generation software for GGML and GGUF models. It also comes with an OpenAI-compatible API endpoint when serving a model, which makes it easy to use with LibreChat and other software that can connect to OpenAI-compatible endpoints.
- AMD Inference
- Any Online Communities on Local/Home AI?
- Koboldcpp-1.62.1 adds support for Command-R+
- Show HN: I made an app to use local AI as daily driver
-
Easiest way to show my model to my mom?
FYI this is the easiest way to host on the horde: https://github.com/LostRuins/koboldcpp
- IT Veteran... why am I struggling with all of this?
-
A note from our sponsor - SaaSHub
www.saashub.com | 11 Jun 2026
Stats
LostRuins/koboldcpp is an open source project licensed under GNU Affero General Public License v3.0 which is an OSI approved license.
The primary programming language of koboldcpp is C++.