llama.cpp

LLM inference in C/C++ (by ggml-org)

Llama.cpp Alternatives

Similar projects and alternatives to llama.cpp

  1. textgen

    887 llama.cpp VS textgen

    Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. ollama

    Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

  4. zed

    288 llama.cpp VS zed

    Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

  5. transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

  6. whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  7. koboldcpp

    Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

  8. llama

    190 llama.cpp VS llama

    Inference code for Llama models

  9. gpt4all

    GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

  10. stanford_alpaca

    Code and documentation to train Stanford's Alpaca models, and generate the data.

  11. alpaca-lora

    107 llama.cpp VS alpaca-lora

    Instruct-tune LLaMA on consumer hardware

  12. mlc-llm

    90 llama.cpp VS mlc-llm

    Universal LLM Deployment Engine with ML Compilation

  13. alpaca.cpp

    Discontinued Locally run an Instruction-Tuned Chat-Style LLM

  14. FastChat

    86 llama.cpp VS FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

  15. ggml

    76 llama.cpp VS ggml

    Tensor library for machine learning

  16. GPTQ-for-LLaMa

    4 bits quantization of LLaMA using GPTQ

  17. llamafile

    Distribute and run LLMs with a single file.

  18. exllama

    66 llama.cpp VS exllama

    A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

  19. llama-cpp-python

    Python bindings for llama.cpp

  20. outlines

    51 llama.cpp VS outlines

    Structured Outputs

  21. llm

    41 llama.cpp VS llm

    Discontinued [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models (by rustformers)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better llama.cpp alternative or higher similarity.

llama.cpp discussion

Log in or Post with

llama.cpp reviews and mentions

Posts with mentions or reviews of llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2026-06-12.

Stats

Basic llama.cpp repo stats
1033
115,929
10.0
4 days ago

ggml-org/llama.cpp is an open source project licensed under MIT License which is an OSI approved license.

llama.cpp is marked as "self-hosted". This means that it can be used as a standalone application on its own.

The primary programming language of llama.cpp is C++.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?