Top 10 C++ llama Projects

llama.cpp

769 56,891 10.0 C++

LLM inference in C/C++

Project mention: Phi-3 Weights Released | news.ycombinator.com | 2024-04-23

well https://github.com/ggerganov/llama.cpp/issues/6849

LocalAI

82 19,593 9.9 C++

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Project mention: Drop-In Replacement for ChatGPT API | news.ycombinator.com | 2024-01-24

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
PowerInfer

4 6,942 9.8 C++

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Project mention: FLaNK 25 December 2023 | dev.to | 2023-12-26

nitro

7 1,572 9.8 C++

An inference server on top of llama.cpp. OpenAI-compatible API, queue, & scaling. Embed a prod-ready, local inference engine in your apps. Powers Jan (by janhq)

Project mention: Ollama Python and JavaScript Libraries | news.ycombinator.com | 2024-01-24

I'd like to see a comparison to nitro https://github.com/janhq/nitro which has been fantastic for running a local LLM.

LlamaGPTJ-chat

3 203 8.7 C++

Simple chat program for LLaMa, GPT-J, and MPT models.

Project mention: New to this community, most models I download fail and end up in a core dump | /r/LocalLLaMA | 2023-05-23

If you want to use that model specifically, check out : https://github.com/kuvaus/LlamaGPTJ-chat

llama_cpp.rb

2 129 9.6 C++

llama_cpp provides Ruby bindings for llama.cpp

Project mention: Llama.cpp: Full CUDA GPU Acceleration | news.ycombinator.com | 2023-06-12

Python sits on the C-glue segment of programming languages (where Perl, PHP, Ruby and Node are also notable members). Being a glue language means having APIs to a lot of external toolchains written in not only C/C++ but many other compiled languages, APIs and system resources. Conda, virtualenv, etc. are godsend modules for making it all work, or even better, to freeze things once they all work, without resourcing to Docker, VMs or shell scripts. It's meant for application and DevOps people who need to slap together, ie, ML, Numpy, Elasticsearch, AWS APIs and REST endpoints and Get $hit Done.
It's annoying to see them "glueys" compared to the binary compiled segment where the heavy lifting is done. Python and others exist to latch on and assimilate. Resistance is futile:
https://pypi.org/project/pyllamacpp/
https://www.npmjs.com/package/llama-node
https://packagist.org/packages/kambo/llama-cpp-php
https://github.com/yoshoku/llama_cpp.rb

collider

1 117 9.4 C++

Large Model Collider - The Platform for serving LLM models

Project mention: Show HN: Collider – the platform for local LLM debug and inference at warp speed | news.ycombinator.com | 2023-11-30

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
pyllamacpp

4 59 7.0 C++

Python bindings for llama.cpp

Project mention: How to - Local LLMs ? | /r/artificial | 2023-05-10

For me, the easiest way was to start was to use https://github.com/abdeladim-s/pyllamacpp. If you have Python installed, do `pip install numpy pyllamacpp`, download compatible model file (links are in README.md) and simply run with `pyllamacpp model.file`.

llama-server-chat-terminal-client

1 10 6.8 C++

Lightweight chat terminal-interface for llama.cpp server compilable for windows and linux.

Project mention: Terminal client chat for llama.cpp server. | /r/LocalLLaMA | 2023-12-05

llama-chat

1 7 7.0 C++

Simple chat program for LLaMa models (by kuvaus)

Project mention: Local vicuna AI for low end pc? | /r/LocalLLaMA | 2023-06-21

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ llama related posts

Phi-3 Weights Released
1 project | news.ycombinator.com | 23 Apr 2024
Llama.cpp Working on Support for Llama3
1 project | news.ycombinator.com | 18 Apr 2024
Mixtral 8x22B
4 projects | news.ycombinator.com | 17 Apr 2024
Llama.cpp: Improve CPU prompt eval speed
1 project | news.ycombinator.com | 17 Apr 2024
Ollama 0.1.32: WizardLM 2, Mixtral 8x22B, macOS CPU/GPU model split
9 projects | news.ycombinator.com | 17 Apr 2024
KodiBot - Local Chatbot App for Desktop
2 projects | dev.to | 11 Apr 2024
More Agents Is All You Need: LLMs performance scales with the number of agents
2 projects | news.ycombinator.com | 6 Apr 2024
A note from our sponsor - SaaSHub
www.saashub.com | 28 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source llama projects in C++? This list will help you:

	Project	Stars
1	llama.cpp	56,891
2	LocalAI	19,593
3	PowerInfer	6,942
4	nitro	1,572
5	LlamaGPTJ-chat	203
6	llama_cpp.rb	129
7	collider	117
8	pyllamacpp	59
9	llama-server-chat-terminal-client	10
10	llama-chat	7