gpt-llama.cpp
learn-langchain
gpt-llama.cpp | learn-langchain | |
---|---|---|
12 | 8 | |
587 | 274 | |
- | - | |
8.2 | 6.7 | |
11 months ago | 12 months ago | |
JavaScript | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gpt-llama.cpp
-
Attempt to run Llama on a remote server with chatbot-ui
hi! I really like the solution https://github.com/keldenl/gpt-llama.cpp which helps to deploy https://github.com/mckaywrigley/chatbot-ui on the local model. I am running this together with Wizard7b or 13b locally and it works fine, but when I tried to upload to a remote server I met an error.
-
Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
sounds like you’re asking for exactly this? https://github.com/keldenl/gpt-llama.cpp
- LLaMA and AutoAPI?
-
New big update to GPTNicheFinder: better trends analysis and scoring system, cleaned up UI and verbose in the terminal for people who want to see what is going on and to verify the results
I salut you good sir. This is an amazing idea. I don't have time but it will be interesting idea to use this wrapper https://github.com/keldenl/gpt-llama.cpp which simulates GPT endpoint for local lama, so basically we can have amazing tool for completely free use. If somebody test it please let me know underneath my comment!
-
I build an AI powered writing tools, an AI co-author
I would gladly buy your product to run with a local model, like Vicuna ggml , also see https://github.com/keldenl/gpt-llama.cpp/
-
Serge... Just works
possible through fastllama in python or gpt-llama.cpp an API wrapper around llama.cpp
-
Embeddings?
https://github.com/keldenl/gpt-llama.cpp supports embeddings, and it even takes in openai type requests and returns openai compatible responses!
-
I built a completely Local AutoGPT with the help of GPT-llama running Vicuna-13B
https://github.com/keldenl/gpt-llama.cpp
- I build a completely Local and portable AutoGPT with the help of gpt-llama, running on Vicuna-13b
-
Adding Long-Term Memory to Custom LLMs: Let's Tame Vicuna Together!
There's a (kind of) working Auto-GPT solution that uses Vicuna https://github.com/keldenl/gpt-llama.cpp/blob/master/docs/Auto-GPT-setup-guide.md
learn-langchain
- Alternative to LangChain for open LLMs?
- Can someone explain why there isn't a good interface for the oobabooga api in langchain?
- Vicuna/LLaMMA Models and Langchain Tools
- Ho to run .safetensors models with langchain/huggingface pipelines?
- Local Vicuna: Building a Q/A bot over a text file with langchain, Vicuna and Sentence Transformers
-
Embeddings?
Source code: https://github.com/paolorechia/learn-langchain/tree/main/langchain_app/document
-
Is it possible to run GPTQ quantized 4bit 13B Vicuna locally on a GPU with langchain?
If not and you need to stream and cut off the text more manually, you may want to take a look at this implementation of Vicuna under LangChain: https://github.com/paolorechia/learn-langchain/
-
Creating an AI Agent with Vicuna 7B and Langchain: fetching a random Chuck Norris joke
You can find my code here: https://github.com/paolorechia/learn-langchain
What are some alternatives?
llama_index - LlamaIndex is a data framework for your LLM applications
AgentOoba - An autonomous AI agent extension for Oobabooga's web ui
Auto-LLM-Local - Created my own python script similar to AutoGPT where you supply a local llm model like alpaca13b (The main one I use), and the script can access the supplied tools to achieve your objective. Code fully works as far as I can tell. Takes me 5 minutes per chain on my slow laptop.
gptq_for_langchain - A guide about how to use GPTQ models with langchain
long_term_memory - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]
vicuna-react-lora - An experiment of finetuning Vicuna with ReAct instructions
semantic-kernel - Integrate cutting-edge LLM technology quickly and easily into your apps
GPTQ-for-LLaMa-API - Provide a way to use the GPT-QLLama model as an API
langchain - 🦜🔗 Build context-aware reasoning applications
BrainChulo - Harnessing the Memory Power of the Camelids