ik_llama.cpp

llama.cpp fork with additional SOTA quants and improved performance (by ikawrakow)

Ik_llama.cpp Alternatives

Similar projects and alternatives to ik_llama.cpp

  1. llama.cpp

    LLM inference in C/C++

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. ollama

    Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

  4. nano-vllm

    Nano vLLM

  5. tt-metal

    :metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better ik_llama.cpp alternative or higher similarity.

ik_llama.cpp discussion

Log in or Post with

ik_llama.cpp reviews and mentions

Posts with mentions or reviews of ik_llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-06-24.

Stats

Basic ik_llama.cpp repo stats
3
640
9.7
7 days ago

ikawrakow/ik_llama.cpp is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of ik_llama.cpp is C++.


Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?