go-llama.cpp vs llama-cpp-python

go-llama.cpp

LLama.cpp golang bindings (by go-skynet)

Suggest topics

Source Code

Suggest alternative

Edit details

llama-cpp-python

Python bindings for llama.cpp (by abetlen)

Suggest topics

Source Code

llama-cpp-python.readthedocs.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

go-llama.cpp		llama-cpp-python
	Project
4	Mentions	56
585	Stars	6,725
9.6%	Growth	-
7.9	Activity	9.8
5 days ago	Latest Commit	6 days ago
C++	Language	Python
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

go-llama.cpp

Posts with mentions or reviews of go-llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-19.

Lokale LLM: Gibt es bereits welche für <= 4 GB vRAM?
1 project | /r/KI_Welt | 10 Jul 2023
LocalAI v1.19.0 - CUDA GPU support!
2 projects | /r/selfhosted | 19 Jun 2023

Full CUDA GPU offload support ( PR by mudler. Thanks to chnyda for handing over the GPU access, and lu-zero to help in debugging )
Could I get a suggestion for a simple HTTP API with no GUI for llama.cpp?
8 projects | /r/LocalLLaMA | 16 May 2023

Go: go-skynet/go-llama.cpp
Redirecting Model Outputs from llama.cpp to a TXT File for Easier Tracking of Results?
1 project | /r/LocalLLaMA | 2 May 2023

I've had great success using go-llama.cpp to wrap llama in a much-friendlier language. The install process is a bit clunky- go does not like compiling submodules, so you need to use a replace within the go.mod file to point towards a local copy of go-llama.cpp that you've already compiled manually.

llama-cpp-python

Posts with mentions or reviews of llama-cpp-python. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-28.

Ollama v0.1.33 with Llama 3, Phi 3, and Qwen 110B
11 projects | news.ycombinator.com | 28 Apr 2024

There's a Python binding for llama.cpp which is actively maintained and has worked well for me: https://github.com/abetlen/llama-cpp-python
FLaNK AI for 11 March 2024
46 projects | dev.to | 11 Mar 2024
OpenAI: Memory and New Controls for ChatGPT
4 projects | news.ycombinator.com | 13 Feb 2024

I'll share the core bit that took a while to figure out the right format, my main script is a hot mess using embeddings with SentenceTransformer, so I won't share that yet. E.g: last night I did a PR for llama-cpp-python that shows how Phi might be used with JSON only for the author to write almost exactly the same code at pretty much the same time. https://github.com/abetlen/llama-cpp-python/pull/1184
TinyLlama LLM: A Step-by-Step Guide to Implementing the 1.1B Model on Google Colab
2 projects | dev.to | 6 Jan 2024

Python Bindings for llama.cpp
Mistral-8x7B-Chat
4 projects | news.ycombinator.com | 10 Dec 2023
Running Mistral LLM on Apple Silicon Using Apple's MLX Framework Is Much Faster
2 projects | news.ycombinator.com | 6 Dec 2023

If the model could be made to work with llama.cpp, then https://github.com/abetlen/llama-cpp-python might be more compact. llama.cpp only supports a limited list of model types though.
Run ChatGPT-like LLMs on your laptop in 3 lines of code
9 projects | news.ycombinator.com | 6 Sep 2023
Code Llama, a state-of-the-art large language model for coding
4 projects | news.ycombinator.com | 24 Aug 2023

https://github.com/abetlen/llama-cpp-python has a web server mode that replicates openai's API iirc and the readme shows it has docker builds already.
Meta: Code Llama, an AI Tool for Coding
18 projects | news.ycombinator.com | 24 Aug 2023

LocalAI https://localai.io/ and LMStudio https://lmstudio.ai/ both have fairly complete OpenAI compatibility layers. llama-cpp-python has a FastAPI server as well: https://github.com/abetlen/llama-cpp-python/blob/main/llama_... (as of this moment it hasn't merged GGUF update yet though)
First steps with llama
2 projects | dev.to | 31 Jul 2023

I went with Python, llama-cpp-python, since my goal is just to get a small project up and running locally.

Compare go-llama.cpp vs llama-cpp-python and see what are their differences.

go-llama.cpp

llama-cpp-python

go-llama.cpp

llama-cpp-python