gpt-llama.cpp vs basaran

gpt-llama.cpp

A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI. (by keldenl)

Suggest topics

Source Code

Suggest alternative

Edit details

basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models. (by hyperonym)

generative-model Gpt huggingface language-model Natural Language Processing openai-api streaming-api text-generation chatgpt

DISCONTINUED

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

gpt-llama.cpp		basaran
	Project
12	Mentions	22
587	Stars	1,281
-	Growth	-
8.2	Activity	10.0
11 months ago	Latest Commit	4 months ago
JavaScript	Language	Python
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt-llama.cpp

Posts with mentions or reviews of gpt-llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-22.

Attempt to run Llama on a remote server with chatbot-ui
2 projects | /r/LocalLLaMA | 22 Jun 2023

hi! I really like the solution https://github.com/keldenl/gpt-llama.cpp which helps to deploy https://github.com/mckaywrigley/chatbot-ui on the local model. I am running this together with Wizard7b or 13b locally and it works fine, but when I tried to upload to a remote server I met an error.
Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
9 projects | /r/LocalLLaMA | 1 Jun 2023

sounds like you’re asking for exactly this? https://github.com/keldenl/gpt-llama.cpp
LLaMA and AutoAPI?
1 project | /r/LocalLLaMA | 17 May 2023
New big update to GPTNicheFinder: better trends analysis and scoring system, cleaned up UI and verbose in the terminal for people who want to see what is going on and to verify the results
2 projects | /r/GPT3 | 16 May 2023

I salut you good sir. This is an amazing idea. I don't have time but it will be interesting idea to use this wrapper https://github.com/keldenl/gpt-llama.cpp which simulates GPT endpoint for local lama, so basically we can have amazing tool for completely free use. If somebody test it please let me know underneath my comment!
I build an AI powered writing tools, an AI co-author
1 project | /r/singularity | 29 Apr 2023

I would gladly buy your product to run with a local model, like Vicuna ggml , also see https://github.com/keldenl/gpt-llama.cpp/
Serge... Just works
3 projects | /r/LocalLLaMA | 28 Apr 2023

possible through fastllama in python or gpt-llama.cpp an API wrapper around llama.cpp
Embeddings?
3 projects | /r/LocalLLaMA | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp supports embeddings, and it even takes in openai type requests and returns openai compatible responses!
I built a completely Local AutoGPT with the help of GPT-llama running Vicuna-13B
1 project | news.ycombinator.com | 24 Apr 2023

https://github.com/keldenl/gpt-llama.cpp
I build a completely Local and portable AutoGPT with the help of gpt-llama, running on Vicuna-13b
4 projects | /r/LocalLLaMA | 24 Apr 2023
Adding Long-Term Memory to Custom LLMs: Let's Tame Vicuna Together!
7 projects | /r/LocalLLaMA | 21 Apr 2023

There's a (kind of) working Auto-GPT solution that uses Vicuna https://github.com/keldenl/gpt-llama.cpp/blob/master/docs/Auto-GPT-setup-guide.md

basaran

Posts with mentions or reviews of basaran. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-19.

OpenLLM
10 projects | news.ycombinator.com | 19 Jun 2023
Langchain and self hosted LLaMA hosted API
6 projects | /r/LocalLLaMA | 15 Jun 2023

What are the current best "no reinventing the wheel" approaches to have Langchain use an LLM through a locally hosted REST API, the likes of Oobabooga or hyperonym/basaran with streaming support for 4-bit GPTQ?
Run and create custom ChatGPT-like bots with OpenChat
15 projects | news.ycombinator.com | 7 Jun 2023

Disclaimer: I am curating LLM-tools on github [1]
A few thoughts:
* allow for custom endpoint URLs, this way people can use open source LLMs with a fake openAI API backend like basaran[2] or llama-api-server[3]
* look into better embedding methods for info-retrieval like InstructorEmbeddings or Document Summary Index
* Don't use a single embedding per content item, use multiple to increase retrieval quality
1 https://github.com/underlines/awesome-marketing-datascience/...
2 https://github.com/hyperonym/basaran
3 https://github.com/iaalm/llama-api-server
1-Jun-2023
2 projects | /r/dailyainews | 2 Jun 2023

open-source alternative to the OpenAI text completion API (https://github.com/hyperonym/basaran)
Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
9 projects | /r/LocalLLaMA | 1 Jun 2023
Basaran is an open-source alternative to the OpenAI text completion API
1 project | news.ycombinator.com | 31 May 2023
Ask HN: What's the best self hosted/local alternative to GPT-4?
12 projects | news.ycombinator.com | 31 May 2023

Guanaco-65B[0] using Basaran[1] for your OpenAI compatible API. You can use any ChatGPT front-end which lets you change the OpenAI endpoint URL.
[0] An fp4 finetune of LLaMA-30B by Tim Dettmers
[1] https://github.com/hyperonym/basaran
Are all the finetunes stupid?
5 projects | /r/LocalLLaMA | 22 Apr 2023

For lm-eval, I think you'd either need to take GPTQ's inference script and shim it into a model: https://github.com/EleutherAI/lm-evaluation-harness/tree/master/lm_eval/models or you might be able to use a project like https://github.com/hyperonym/basaran and then you could use the gpt3 model...
Using the API in Node
3 projects | /r/Oobabooga | 11 Apr 2023

There are also: - Basaran repo: "Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models". "...Compatibility with OpenAI API and client libraries..."; - llama-cpp-python repo: "Simple Python bindings for @ggerganov's llama.cpp library...". "...OpenAI-like API...".
Researcher looking for help with how to prepare a finetuning dataset for models like Bloomz and Cerebras-GPT
2 projects | /r/ArtificialInteligence | 2 Apr 2023

I want to start with a totally freely available model, so again, that excludes things like LLaMA where the weights are only available through a wait list. The two models that most get my attention and (I think, and hope) fit my criteria of open availability are Cerebras-GPT (13b) and Bloomz (7b). The tools to process and fine-tune that seem most feasible to me, from my limit knowledge, are xturing and basaran.

What are some alternatives?

When comparing gpt-llama.cpp and basaran you can also consider the following projects:

llama_index - LlamaIndex is a data framework for your LLM applications

text-generation-inference - Large Language Model Text Generation Inference

Auto-LLM-Local - Created my own python script similar to AutoGPT where you supply a local llm model like alpaca13b (The main one I use), and the script can access the supplied tools to achieve your objective. Code fully works as far as I can tell. Takes me 5 minutes per chain on my slow laptop.

openai-chatgpt-opentranslator - Python command that uses openai to perform text translations

long_term_memory - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.

AutoGPTQ - An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

NeMo-Guardrails - NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

semantic-kernel - Integrate cutting-edge LLM technology quickly and easily into your apps

llm-foundry - LLM training code for Databricks foundation models

langchain - 🦜🔗 Build context-aware reasoning applications

alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM

gpt-llama.cpp vs llama_index basaran vs text-generation-inference gpt-llama.cpp vs Auto-LLM-Local basaran vs openai-chatgpt-opentranslator gpt-llama.cpp vs long_term_memory basaran vs AutoGPTQ gpt-llama.cpp vs langchain basaran vs NeMo-Guardrails gpt-llama.cpp vs semantic-kernel basaran vs llm-foundry gpt-llama.cpp vs langchain basaran vs alpaca.cpp

Compare gpt-llama.cpp vs basaran and see what are their differences.

gpt-llama.cpp

basaran

gpt-llama.cpp

basaran

What are some alternatives?