Best way to use AMD CPU and GPU

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

serge

40 5,543 9.8 Svelte

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Serge made it really easy for me to get started, but it all CPU-based.

mlc-llm

89 16,955 9.9 Python

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

MLC LLM looks like an easy option to use my AMD GPU.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
llama.cpp

773 56,891 10.0 C++

LLM inference in C/C++

Llama.cpp seems like it can use both CPU and GPU, but I haven't quite figured that out yet.

koboldcpp

180 3,817 10.0 C++

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

You can try Koboldcpp with CLblast from this repo: https://github.com/LostRuins/koboldcpp/releases It allows to offload several layers to GPU with significant boost of prompt processing speed and inference speed.

exllama

64 2,594 9.0 Python

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

As mentioned, exllama is the way to go. Once you install ROCm and the official ROCm PyTorch you are ready to go. A 16GB 6800XT will support running 13B 4-bit GPTQs with full context. Spare_Side just posted a report w/ the same GPU.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

6 projects | /r/homeautomation | 27 Apr 2023
LocalAI: OpenAI compatible API to run LLM models locally on consumer grade hardware!

13 projects | /r/selfhosted | 23 Apr 2023
chatgpt alternative

3 projects | /r/selfhosted | 8 Dec 2023
Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2

12 projects | news.ycombinator.com | 16 Aug 2023
LeCun: Qualcomm working with Meta to run Llama-2 on mobile devices

4 projects | news.ycombinator.com | 23 Jul 2023

Best way to use AMD CPU and GPU

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
llama llm alpaca machine-learning-compilation Docker
Post date: 17 Jun 2023

serge

mlc-llm

InfluxDB

llama.cpp

koboldcpp

exllama

SaaSHub

Related posts

Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

LocalAI: OpenAI compatible API to run LLM models locally on consumer grade hardware!

chatgpt alternative

Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2

LeCun: Qualcomm working with Meta to run Llama-2 on mobile devices

Best way to use AMD CPU and GPU

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA llama llm alpaca machine-learning-compilation Docker Post date: 17 Jun 2023

serge

mlc-llm

InfluxDB

llama.cpp

koboldcpp

exllama

SaaSHub

Related posts

Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

LocalAI: OpenAI compatible API to run LLM models locally on consumer grade hardware!

chatgpt alternative

Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2

LeCun: Qualcomm working with Meta to run Llama-2 on mobile devices

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
llama llm alpaca machine-learning-compilation Docker
Post date: 17 Jun 2023