Best way to use AMD CPU and GPU

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • serge

    A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

  • Serge made it really easy for me to get started, but it all CPU-based.

  • mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

  • MLC LLM looks like an easy option to use my AMD GPU.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • llama.cpp

    LLM inference in C/C++

  • Llama.cpp seems like it can use both CPU and GPU, but I haven't quite figured that out yet.

  • koboldcpp

    A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

  • You can try Koboldcpp with CLblast from this repo: https://github.com/LostRuins/koboldcpp/releases It allows to offload several layers to GPU with significant boost of prompt processing speed and inference speed.

  • exllama

    A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

  • As mentioned, exllama is the way to go. Once you install ROCm and the official ROCm PyTorch you are ready to go. A 16GB 6800XT will support running 13B 4-bit GPTQs with full context. Spare_Side just posted a report w/ the same GPU.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

    6 projects | /r/homeautomation | 27 Apr 2023
  • LocalAI: OpenAI compatible API to run LLM models locally on consumer grade hardware!

    13 projects | /r/selfhosted | 23 Apr 2023
  • chatgpt alternative

    3 projects | /r/selfhosted | 8 Dec 2023
  • Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2

    12 projects | news.ycombinator.com | 16 Aug 2023
  • LeCun: Qualcomm working with Meta to run Llama-2 on mobile devices

    4 projects | news.ycombinator.com | 23 Jul 2023