gpt-llama.cpp vs llama.cpp

gpt-llama.cpp

A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI. (by keldenl)

Suggest topics

Source Code

Suggest alternative

Edit details

llama.cpp

LLM inference in C/C++ (by ggerganov)

llama llm

Source Code

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

gpt-llama.cpp		llama.cpp
	Project
12	Mentions	776
587	Stars	57,463
-	Growth	-
8.2	Activity	10.0
11 months ago	Latest Commit	5 days ago
JavaScript	Language	C++
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt-llama.cpp

Posts with mentions or reviews of gpt-llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-22.

Attempt to run Llama on a remote server with chatbot-ui
2 projects | /r/LocalLLaMA | 22 Jun 2023

hi! I really like the solution https://github.com/keldenl/gpt-llama.cpp which helps to deploy https://github.com/mckaywrigley/chatbot-ui on the local model. I am running this together with Wizard7b or 13b locally and it works fine, but when I tried to upload to a remote server I met an error.
Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
9 projects | /r/LocalLLaMA | 1 Jun 2023

sounds like you’re asking for exactly this? https://github.com/keldenl/gpt-llama.cpp
LLaMA and AutoAPI?
1 project | /r/LocalLLaMA | 17 May 2023
New big update to GPTNicheFinder: better trends analysis and scoring system, cleaned up UI and verbose in the terminal for people who want to see what is going on and to verify the results
2 projects | /r/GPT3 | 16 May 2023

I salut you good sir. This is an amazing idea. I don't have time but it will be interesting idea to use this wrapper https://github.com/keldenl/gpt-llama.cpp which simulates GPT endpoint for local lama, so basically we can have amazing tool for completely free use. If somebody test it please let me know underneath my comment!
I build an AI powered writing tools, an AI co-author
1 project | /r/singularity | 29 Apr 2023

I would gladly buy your product to run with a local model, like Vicuna ggml , also see https://github.com/keldenl/gpt-llama.cpp/
Serge... Just works
3 projects | /r/LocalLLaMA | 28 Apr 2023

possible through fastllama in python or gpt-llama.cpp an API wrapper around llama.cpp
Embeddings?
3 projects | /r/LocalLLaMA | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp supports embeddings, and it even takes in openai type requests and returns openai compatible responses!
I built a completely Local AutoGPT with the help of GPT-llama running Vicuna-13B
1 project | news.ycombinator.com | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp
I build a completely Local and portable AutoGPT with the help of gpt-llama, running on Vicuna-13b
4 projects | /r/LocalLLaMA | 24 Apr 2023
Adding Long-Term Memory to Custom LLMs: Let's Tame Vicuna Together!
7 projects | /r/LocalLLaMA | 21 Apr 2023

There's a (kind of) working Auto-GPT solution that uses Vicuna https://github.com/keldenl/gpt-llama.cpp/blob/master/docs/Auto-GPT-setup-guide.md

llama.cpp

Posts with mentions or reviews of llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-07.

IBM Granite: A Family of Open Foundation Models for Code Intelligence
3 projects | news.ycombinator.com | 7 May 2024

if you can compile stuff, then looking at llama.cpp (what ollama uses) is also interesting: https://github.com/ggerganov/llama.cpp
the server is here: https://github.com/ggerganov/llama.cpp/tree/master/examples/...
And you can search for any GGUF on huggingface
Ask HN: Affordable hardware for running local large language models?
1 project | news.ycombinator.com | 5 May 2024

Yes, Metal seems to allow a maximum of 1/2 of the RAM for one process, and 3/4 of the RAM allocated to the GPU overall. There’s a kernel hack to fix it, but that comes with the usual system integrity caveats. https://github.com/ggerganov/llama.cpp/discussions/2182
Xmake: A modern C/C++ build tool
7 projects | news.ycombinator.com | 4 May 2024
Better and Faster Large Language Models via Multi-Token Prediction
1 project | news.ycombinator.com | 1 May 2024

For anyone interested in exploring this, llama.cpp has an example implementation here:
https://github.com/ggerganov/llama.cpp/tree/master/examples/...
Llama.cpp Bfloat16 Support
1 project | news.ycombinator.com | 30 Apr 2024
Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps
1 project | dev.to | 30 Apr 2024

Getting started with LLMs can be intimidating. In this tutorial we will show you how to fine-tune a large language model using LoRA, facilitated by tools like llama.cpp and KitOps.
GGML Flash Attention support merged into llama.cpp
1 project | news.ycombinator.com | 30 Apr 2024
Phi-3 Weights Released
1 project | news.ycombinator.com | 23 Apr 2024

well https://github.com/ggerganov/llama.cpp/issues/6849
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
3 projects | news.ycombinator.com | 21 Apr 2024
Llama.cpp Working on Support for Llama3
1 project | news.ycombinator.com | 18 Apr 2024

What are some alternatives?

When comparing gpt-llama.cpp and llama.cpp you can also consider the following projects:

llama_index - LlamaIndex is a data framework for your LLM applications

ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Auto-LLM-Local - Created my own python script similar to AutoGPT where you supply a local llm model like alpaca13b (The main one I use), and the script can access the supplied tools to achieve your objective. Code fully works as far as I can tell. Takes me 5 minutes per chain on my slow laptop.

gpt4all - gpt4all: run open-source LLMs anywhere

long_term_memory - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ

semantic-kernel - Integrate cutting-edge LLM technology quickly and easily into your apps

ggml - Tensor library for machine learning

langchain - 🦜🔗 Build context-aware reasoning applications

alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM