Llama_cpp.rb Alternatives

Similar projects and alternatives to llama_cpp.rb

llama.cpp

769 56,891 10.0 C++ llama_cpp.rb VS llama.cpp

LLM inference in C/C++
whisper.cpp

187 31,174 9.8 C llama_cpp.rb VS whisper.cpp

Port of OpenAI's Whisper model in C/C++
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
mlc-llm

89 16,955 9.9 Python llama_cpp.rb VS mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
FastChat

82 33,877 9.6 Python llama_cpp.rb VS FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LocalAI

82 19,593 9.9 C++ llama_cpp.rb VS LocalAI

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
exllama

64 2,582 9.0 Python llama_cpp.rb VS exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
llama-cpp-python

54 6,378 9.9 Python llama_cpp.rb VS llama-cpp-python

Python bindings for llama.cpp
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
lit-llama

23 5,789 8.4 Python llama_cpp.rb VS lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
darknet

22 25,292 0.0 C llama_cpp.rb VS darknet

Convolutional Neural Networks
go-llama.cpp

4 554 8.4 C++ llama_cpp.rb VS go-llama.cpp

LLama.cpp golang bindings
serving

12 6,071 9.8 C++ llama_cpp.rb VS serving

A flexible, high-performance serving system for machine learning models
flake

5 593 4.4 Nix llama_cpp.rb VS flake

A Nix flake for many AI projects
LLamaSharp

3 1,871 9.8 C# llama_cpp.rb VS LLamaSharp

A C#/.NET library to run LLM models (🦙LLaMA/LLaVA) on your local device efficiently.
llama.cpp-dotnet

1 48 9.4 C# llama_cpp.rb VS llama.cpp-dotnet

Minimal C# bindings for llama.cpp + .NET core library with API host/client.
llama-cpp.el

1 18 7.8 Emacs Lisp llama_cpp.rb VS llama-cpp.el

A client for llama-cpp server
llama-go

1 149 6.8 C llama_cpp.rb VS llama-go

Port of Facebook's LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++
TokenHawk

1 98 10.0 C++ llama_cpp.rb VS TokenHawk

Discontinued WebGPU LLM inference tuned by hand [Moved to: https://github.com/kayvr/token-hawk]
llama.cpp

1 4 9.4 C llama_cpp.rb VS llama.cpp

Port of Facebook's LLaMA model in C/C++ (by SlyEcho)
llama-node

2 847 8.6 Rust llama_cpp.rb VS llama-node

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better llama_cpp.rb alternative or higher similarity.

Suggest an alternative to llama_cpp.rb

llama_cpp.rb reviews and mentions

Posts with mentions or reviews of llama_cpp.rb. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-12.

Llama.cpp: Full CUDA GPU Acceleration
14 projects | news.ycombinator.com | 12 Jun 2023

Python sits on the C-glue segment of programming languages (where Perl, PHP, Ruby and Node are also notable members). Being a glue language means having APIs to a lot of external toolchains written in not only C/C++ but many other compiled languages, APIs and system resources. Conda, virtualenv, etc. are godsend modules for making it all work, or even better, to freeze things once they all work, without resourcing to Docker, VMs or shell scripts. It's meant for application and DevOps people who need to slap together, ie, ML, Numpy, Elasticsearch, AWS APIs and REST endpoints and Get $hit Done.
It's annoying to see them "glueys" compared to the binary compiled segment where the heavy lifting is done. Python and others exist to latch on and assimilate. Resistance is futile:
https://pypi.org/project/pyllamacpp/
https://www.npmjs.com/package/llama-node
https://packagist.org/packages/kambo/llama-cpp-php
https://github.com/yoshoku/llama_cpp.rb
Could I get a suggestion for a simple HTTP API with no GUI for llama.cpp?
8 projects | /r/LocalLLaMA | 16 May 2023

Ruby: yoshoku/llama_cpp.rb