Help needed with installing quant_cuda for the WebUI

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

GPTQ-for-LLaMa

75 2,913 8.6 Python

4 bits quantization of LLaMA using GPTQ

cd repositories git clone https://github.com/qwopqwop200/GPTQ-for-LLaMa pip install -r requirements.txt

GPTQ-for-LLaMa

19 129 7.7 Python

4 bits quantization of LLaMa using GPTQ (by oobabooga)

This worked for me on Ubuntu. If you want to use the CUDA branch instead of triton, do the same steps except clone this GPTQ-for-LLaMa fork and run python setup_cuda.py install

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

I started a list of fetch utilities, like Neofetch, by the community
6 projects | /r/linux | 25 Sep 2022
susfetch: sus and fast fetch utility made in C
2 projects | /r/sysfetch | 17 Jul 2022
Awesome Fetch | awesome-fetch – command-line fetch tools for system information. Operating system, kernel, CPU, GPU, memory info …
1 project | /r/commandline | 7 Apr 2022
Awesome Fetch | awesome-fetch – command-line fetch tools for system information. Operating system, kernel, CPU, GPU, memory info …
2 projects | /r/freebsd | 7 Apr 2022
Flexfetch, a fast and generic fetch program
3 projects | /r/sysfetch | 4 Aug 2021

Help needed with installing quant_cuda for the WebUI

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Post date: 31 May 2023

GPTQ-for-LLaMa

GPTQ-for-LLaMa

InfluxDB

Related posts