llama-cpp-python VS llama.cpp

Compare llama-cpp-python vs llama.cpp and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
llama-cpp-python llama.cpp
55 775
6,475 57,463
- -
9.8 10.0
6 days ago 1 day ago
Python C++
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

llama-cpp-python

Posts with mentions or reviews of llama-cpp-python. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-28.

llama.cpp

Posts with mentions or reviews of llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-04.

What are some alternatives?

When comparing llama-cpp-python and llama.cpp you can also consider the following projects:

LocalAI - :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.

intel-extension-for-pytorch - A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

gpt4all - gpt4all: run open-source LLMs anywhere

text-generation-inference - Large Language Model Text Generation Inference

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ

FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

ggml - Tensor library for machine learning

alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM