gpt-llama.cpp vs serge

gpt-llama.cpp

A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI. (by keldenl)

Suggest topics

Source Code

Suggest alternative

Edit details

serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API. (by serge-chat)

llama alpaca Docker Fastapi llamacpp Python Web Svelte sveltekit Tailwindcss Nginx

Source Code

serge.chat

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

gpt-llama.cpp		serge
	Project
12	Mentions	40
587	Stars	5,553
-	Growth	0.9%
8.2	Activity	9.8
11 months ago	Latest Commit	3 days ago
JavaScript	Language	Svelte
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt-llama.cpp

Posts with mentions or reviews of gpt-llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-22.

Attempt to run Llama on a remote server with chatbot-ui
2 projects | /r/LocalLLaMA | 22 Jun 2023

hi! I really like the solution https://github.com/keldenl/gpt-llama.cpp which helps to deploy https://github.com/mckaywrigley/chatbot-ui on the local model. I am running this together with Wizard7b or 13b locally and it works fine, but when I tried to upload to a remote server I met an error.
Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
9 projects | /r/LocalLLaMA | 1 Jun 2023

sounds like you’re asking for exactly this? https://github.com/keldenl/gpt-llama.cpp
LLaMA and AutoAPI?
1 project | /r/LocalLLaMA | 17 May 2023
New big update to GPTNicheFinder: better trends analysis and scoring system, cleaned up UI and verbose in the terminal for people who want to see what is going on and to verify the results
2 projects | /r/GPT3 | 16 May 2023

I salut you good sir. This is an amazing idea. I don't have time but it will be interesting idea to use this wrapper https://github.com/keldenl/gpt-llama.cpp which simulates GPT endpoint for local lama, so basically we can have amazing tool for completely free use. If somebody test it please let me know underneath my comment!
I build an AI powered writing tools, an AI co-author
1 project | /r/singularity | 29 Apr 2023

I would gladly buy your product to run with a local model, like Vicuna ggml , also see https://github.com/keldenl/gpt-llama.cpp/
Serge... Just works
3 projects | /r/LocalLLaMA | 28 Apr 2023

possible through fastllama in python or gpt-llama.cpp an API wrapper around llama.cpp
Embeddings?
3 projects | /r/LocalLLaMA | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp supports embeddings, and it even takes in openai type requests and returns openai compatible responses!
I built a completely Local AutoGPT with the help of GPT-llama running Vicuna-13B
1 project | news.ycombinator.com | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp
I build a completely Local and portable AutoGPT with the help of gpt-llama, running on Vicuna-13b
4 projects | /r/LocalLLaMA | 24 Apr 2023
Adding Long-Term Memory to Custom LLMs: Let's Tame Vicuna Together!
7 projects | /r/LocalLLaMA | 21 Apr 2023

There's a (kind of) working Auto-GPT solution that uses Vicuna https://github.com/keldenl/gpt-llama.cpp/blob/master/docs/Auto-GPT-setup-guide.md

serge

Posts with mentions or reviews of serge. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-27.

Show HN: I made an app to use local AI as daily driver
31 projects | news.ycombinator.com | 27 Feb 2024
chatgpt alternative
3 projects | /r/selfhosted | 8 Dec 2023
Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2
12 projects | news.ycombinator.com | 16 Aug 2023

Very cool, this looks like a combination of chatbot-ui and llama-cpp-python? A similar project I've been using is https://github.com/serge-chat/serge. Nous-Hermes-Llama2-13b is my daily driver and scores high on coding evaluations (https://huggingface.co/spaces/mike-ravkine/can-ai-code-resul...).
LeCun: Qualcomm working with Meta to run Llama-2 on mobile devices
4 projects | news.ycombinator.com | 23 Jul 2023

You might be pleased to hear that nothing really stops you from doing this today. If you ran Serge[0] on a Mac with Tailscale, you could hack together a decently-accelerated Llama chatbot.
[0] https://github.com/serge-chat/serge
Chatbot frontend library in Svelte?
2 projects | /r/sveltejs | 28 Jun 2023

Cannot help you with libraries specifically but both Serge and ChatUI are built using SvelteKit, so the code might be of some use to you.
We’re back and…
5 projects | /r/selfhosted | 18 Jun 2023
Best way to use AMD CPU and GPU
5 projects | /r/LocalLLaMA | 17 Jun 2023

Serge made it really easy for me to get started, but it all CPU-based.
Need Help
2 projects | /r/LangChain | 15 Jun 2023

All that said this project probably solves your problem: https://github.com/serge-chat/serge
Are you selfhosting a ChatGPT alternative?
8 projects | /r/selfhosted | 9 Jun 2023
What the hell??
1 project | /r/Weird | 3 Jun 2023

You can play a little bit with more straightforward local models (the simplest to setup is https://github.com/nsarrazin/serge ), to see that any LLM is basically a party trick.

What are some alternatives?

When comparing gpt-llama.cpp and serge you can also consider the following projects:

llama_index - LlamaIndex is a data framework for your LLM applications

gpt4all - gpt4all: run open-source LLMs anywhere

Auto-LLM-Local - Created my own python script similar to AutoGPT where you supply a local llm model like alpaca13b (The main one I use), and the script can access the supplied tools to achieve your objective. Code fully works as far as I can tell. Takes me 5 minutes per chain on my slow laptop.

langflow - ⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

long_term_memory - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.

llama.cpp - LLM inference in C/C++

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

semantic-kernel - Integrate cutting-edge LLM technology quickly and easily into your apps

FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

langchain - 🦜🔗 Build context-aware reasoning applications

llama-gpt - A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!