What's the most basic NVIDIA graphics card that will work with mainstream 7B GPU models?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

alpaca-electron

8 1,261 5.9 JavaScript

The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your own computer
mlc-llm

89 17,215 9.9 Python

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

4GB of VRAM is enough to run a 7B model. Try this, https://github.com/mlc-ai/mlc-llm. It uses Vulkan instead of CUDA. Their converted 7B model runs on a GTX 4GB card for me. It's pretty speedy too.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
llama.cpp

780 58,425 10.0 C++

LLM inference in C/C++

They work great for me. No need to get fancy, the easiest thing is to use the distribution from the person who wrote the code that everyone else is using, it works great. It's at https://github.com/ggerganov/llama.cpp. I use it with everything from 7B to 65B models.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ai on a android phone?

2 projects | /r/LocalLLaMA | 8 Dec 2023
MLC vs llama.cpp

2 projects | /r/LocalLLaMA | 7 Nov 2023
[Project] Scaling LLama2 70B with Multi NVIDIA and AMD GPUs under 3k budget

1 project | /r/LocalLLaMA | 21 Oct 2023
Scaling LLama2-70B with Multi Nvidia/AMD GPU

2 projects | news.ycombinator.com | 19 Oct 2023
ROCm Is AMD's #1 Priority, Executive Says

5 projects | news.ycombinator.com | 26 Sep 2023

What's the most basic NVIDIA graphics card that will work with mainstream 7B GPU models?

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga
llm machine-learning-compilation language-model tvm
Post date: 4 May 2023

alpaca-electron

mlc-llm

InfluxDB

llama.cpp

Related posts

Ai on a android phone?

MLC vs llama.cpp

[Project] Scaling LLama2 70B with Multi NVIDIA and AMD GPUs under 3k budget

Scaling LLama2-70B with Multi Nvidia/AMD GPU

ROCm Is AMD's #1 Priority, Executive Says

What's the most basic NVIDIA graphics card that will work with mainstream 7B GPU models?

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga llm machine-learning-compilation language-model tvm Post date: 4 May 2023

alpaca-electron

mlc-llm

InfluxDB

llama.cpp

Related posts

Ai on a android phone?

MLC vs llama.cpp

[Project] Scaling LLama2 70B with Multi NVIDIA and AMD GPUs under 3k budget

Scaling LLama2-70B with Multi Nvidia/AMD GPU

ROCm Is AMD's #1 Priority, Executive Says

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga
llm machine-learning-compilation language-model tvm
Post date: 4 May 2023