gpt-llama.cpp vs fastLLaMa

gpt-llama.cpp

A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI. (by keldenl)

Suggest topics

Source Code

Suggest alternative

Edit details

fastLLaMa

fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend. (by PotatoSpudowski)

lama Python lamacpp C CPP

Source Code

potatospudowski.github.io

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

gpt-llama.cpp		fastLLaMa
	Project
12	Mentions	6
587	Stars	403
-	Growth	-
8.2	Activity	7.1
11 months ago	Latest Commit	11 months ago
JavaScript	Language	C
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt-llama.cpp

Posts with mentions or reviews of gpt-llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-22.

Attempt to run Llama on a remote server with chatbot-ui
2 projects | /r/LocalLLaMA | 22 Jun 2023

hi! I really like the solution https://github.com/keldenl/gpt-llama.cpp which helps to deploy https://github.com/mckaywrigley/chatbot-ui on the local model. I am running this together with Wizard7b or 13b locally and it works fine, but when I tried to upload to a remote server I met an error.
Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
9 projects | /r/LocalLLaMA | 1 Jun 2023

sounds like you’re asking for exactly this? https://github.com/keldenl/gpt-llama.cpp
LLaMA and AutoAPI?
1 project | /r/LocalLLaMA | 17 May 2023
New big update to GPTNicheFinder: better trends analysis and scoring system, cleaned up UI and verbose in the terminal for people who want to see what is going on and to verify the results
2 projects | /r/GPT3 | 16 May 2023

I salut you good sir. This is an amazing idea. I don't have time but it will be interesting idea to use this wrapper https://github.com/keldenl/gpt-llama.cpp which simulates GPT endpoint for local lama, so basically we can have amazing tool for completely free use. If somebody test it please let me know underneath my comment!
I build an AI powered writing tools, an AI co-author
1 project | /r/singularity | 29 Apr 2023

I would gladly buy your product to run with a local model, like Vicuna ggml , also see https://github.com/keldenl/gpt-llama.cpp/
Serge... Just works
3 projects | /r/LocalLLaMA | 28 Apr 2023

possible through fastllama in python or gpt-llama.cpp an API wrapper around llama.cpp
Embeddings?
3 projects | /r/LocalLLaMA | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp supports embeddings, and it even takes in openai type requests and returns openai compatible responses!
I built a completely Local AutoGPT with the help of GPT-llama running Vicuna-13B
1 project | news.ycombinator.com | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp
I build a completely Local and portable AutoGPT with the help of gpt-llama, running on Vicuna-13b
4 projects | /r/LocalLLaMA | 24 Apr 2023
Adding Long-Term Memory to Custom LLMs: Let's Tame Vicuna Together!
7 projects | /r/LocalLLaMA | 21 Apr 2023

There's a (kind of) working Auto-GPT solution that uses Vicuna https://github.com/keldenl/gpt-llama.cpp/blob/master/docs/Auto-GPT-setup-guide.md

fastLLaMa

Posts with mentions or reviews of fastLLaMa. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-03.

[N] OpenLLaMA: An Open Reproduction of LLaMA
5 projects | /r/MachineLearning | 3 May 2023

If your GPU isn't good enough, you could use llama.cpp, which runs on CPU, or one of its forks like fastLLaMa.
Serge... Just works
3 projects | /r/LocalLLaMA | 28 Apr 2023

possible through fastllama in python or gpt-llama.cpp an API wrapper around llama.cpp
llama-cpp-python VS fastLLaMa - a user suggested alternative
2 projects | 25 Apr 2023

It is better, Lots of low level cpp optimisations that are better
[P] LoRA adapter switching at runtime to enable Base model to inherit multiple personalities
2 projects | /r/MachineLearning | 20 Apr 2023

u/_Arsenie_Boca_ you can have a look at this discussion for more info https://github.com/PotatoSpudowski/fastLLaMa/discussions/48
[P] fastLLaMa, A python wrapper to run llama.cpp
5 projects | /r/MachineLearning | 21 Mar 2023

Repo Link

What are some alternatives?

When comparing gpt-llama.cpp and fastLLaMa you can also consider the following projects:

llama_index - LlamaIndex is a data framework for your LLM applications

llama-cpp-python - Python bindings for llama.cpp

Auto-LLM-Local - Created my own python script similar to AutoGPT where you supply a local llm model like alpaca13b (The main one I use), and the script can access the supplied tools to achieve your objective. Code fully works as far as I can tell. Takes me 5 minutes per chain on my slow laptop.

llama - Inference code for Llama models

long_term_memory - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.

llama.cpp - LLM inference in C/C++

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

llama.py - Python bindings to llama.cpp

semantic-kernel - Integrate cutting-edge LLM technology quickly and easily into your apps

serge - A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

langchain - 🦜🔗 Build context-aware reasoning applications

AGiXT - AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.