localGPT vs llama.cpp

localGPT

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private. (by PromtEngineer)

Suggest topics

Source Code

Suggest alternative

Edit details

llama.cpp

LLM inference in C/C++ (by ggerganov)

llama llm

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

localGPT		llama.cpp
	Project
29	Mentions	774
19,193	Stars	57,463
-	Growth	-
8.6	Activity	10.0
2 days ago	Latest Commit	1 day ago
Python	Language	C++
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

localGPT

Posts with mentions or reviews of localGPT. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-15.

Show HN: IncarnaMind-Chat with your multiple docs using LLMs
4 projects | news.ycombinator.com | 15 Sep 2023

I think local LLMs are great for tinkerers, and with quantization can run on most modern PCs. I am not comfortable sending over my personal data over to OpenAI/Anthropic, so I've been playing around with https://github.com/PromtEngineer/localGPT/, GPT4All, etc. which keep the data all local.
Sliding window chunking, RAG, etc. seem more sophisticated than the other document LLM tools, so I would love to try this out if you ever add the ability to run LLMs locally!
FLaNK Stack Weekly for 21 August 2023
18 projects | dev.to | 21 Aug 2023
PromtEngineer/localGPT: Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
1 project | /r/devopsish | 26 Jul 2023
Ask HN: How do I train a custom LLM/ChatGPT on my own documents?
8 projects | news.ycombinator.com | 23 Jul 2023

localGPT can parse PDF into embeddings, see <https://github.com/PromtEngineer/localGPT>.
Which platform or model to use for fine tuning pdf files ?
1 project | /r/LanguageTechnology | 10 Jul 2023

This is going so fast that it feels like a new thing pops up every day. LocalGPT seems to have gotten a lot of traction though: https://github.com/PromtEngineer/localGPT
Any successful guides on scanning internal pages and build a virtual assistant using LLAMA?
1 project | /r/LocalLLaMA | 9 Jul 2023
CUDA Out of memory with Nvidia A2 need help
1 project | /r/pytorch | 4 Jul 2023

i am currently trying to use localGPT (https://github.com/PromtEngineer/localGPT) for a project and i encountered a problem.
Using Local LLMs for things besides chat?
3 projects | /r/LocalLLaMA | 30 Jun 2023

I tinker a lot with electronics. I have put datasheets for components, documentation for development boards, documentation for software libraries, etc into a database with localGPT.
Question regarding model compatibility for Alpaca Turbo
8 projects | /r/LocalLLaMA | 30 Jun 2023

There are a bunch of other methods to improve quality and performance like tree-of-thought-llm, connecting a LLM to a database or have it review its own output.
Tools for ingesting .pdf files locally for training/fine-tuning?
1 project | /r/ArtificialInteligence | 29 Jun 2023

Check out local gpt on git hub. I tried but it had slow response for me. Other developers are fine. https://github.com/PromtEngineer/localGPT

llama.cpp

Posts with mentions or reviews of llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-04.

Xmake: A modern C/C++ build tool
5 projects | news.ycombinator.com | 4 May 2024
Better and Faster Large Language Models via Multi-Token Prediction
1 project | news.ycombinator.com | 1 May 2024

For anyone interested in exploring this, llama.cpp has an example implementation here:
https://github.com/ggerganov/llama.cpp/tree/master/examples/...
Llama.cpp Bfloat16 Support
1 project | news.ycombinator.com | 30 Apr 2024
Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps
1 project | dev.to | 30 Apr 2024

Getting started with LLMs can be intimidating. In this tutorial we will show you how to fine-tune a large language model using LoRA, facilitated by tools like llama.cpp and KitOps.
GGML Flash Attention support merged into llama.cpp
1 project | news.ycombinator.com | 30 Apr 2024
Phi-3 Weights Released
1 project | news.ycombinator.com | 23 Apr 2024

well https://github.com/ggerganov/llama.cpp/issues/6849
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
3 projects | news.ycombinator.com | 21 Apr 2024
Llama.cpp Working on Support for Llama3
1 project | news.ycombinator.com | 18 Apr 2024
Embeddings are a good starting point for the AI curious app developer
7 projects | news.ycombinator.com | 17 Apr 2024

Have just done this recently for local chat with pdf feature in https://recurse.chat. (It's a macOS app that has built-in llama.cpp server and local vector database)
Running an embedding server locally is pretty straightforward:
- Get llama.cpp release binary: https://github.com/ggerganov/llama.cpp/releases
Mixtral 8x22B
4 projects | news.ycombinator.com | 17 Apr 2024

What are some alternatives?

When comparing localGPT and llama.cpp you can also consider the following projects:

private-gpt - Interact with your documents using the power of GPT, 100% privately, no data leaks

ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.

privateGPT - Interact with your documents using the power of GPT, 100% privately, no data leaks [Moved to: https://github.com/zylon-ai/private-gpt]

gpt4all - gpt4all: run open-source LLMs anywhere

LocalAI - :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

gpt4-pdf-chatbot-langchain - GPT4 & LangChain Chatbot for large PDF docs

GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ

llama_index - LlamaIndex is a data framework for your LLM applications

ggml - Tensor library for machine learning

quivr - Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.

alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM