What's the most basic NVIDIA graphics card that will work with mainstream 7B GPU models?

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • alpaca-electron

    The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your own computer

  • mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

  • 4GB of VRAM is enough to run a 7B model. Try this, https://github.com/mlc-ai/mlc-llm. It uses Vulkan instead of CUDA. Their converted 7B model runs on a GTX 4GB card for me. It's pretty speedy too.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • llama.cpp

    LLM inference in C/C++

  • They work great for me. No need to get fancy, the easiest thing is to use the distribution from the person who wrote the code that everyone else is using, it works great. It's at https://github.com/ggerganov/llama.cpp. I use it with everything from 7B to 65B models.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ai on a android phone?

    2 projects | /r/LocalLLaMA | 8 Dec 2023
  • MLC vs llama.cpp

    2 projects | /r/LocalLLaMA | 7 Nov 2023
  • [Project] Scaling LLama2 70B with Multi NVIDIA and AMD GPUs under 3k budget

    1 project | /r/LocalLLaMA | 21 Oct 2023
  • Scaling LLama2-70B with Multi Nvidia/AMD GPU

    2 projects | news.ycombinator.com | 19 Oct 2023
  • ROCm Is AMD's #1 Priority, Executive Says

    5 projects | news.ycombinator.com | 26 Sep 2023