Better QPTQ-quantized LLaMa soon? Paper authors reach out to improve quantization code

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

GPTQ-for-LLaMa

75 2,913 8.6 Python

4 bits quantization of LLaMA using GPTQ
text-generation-webui

876 35,862 9.9 Python

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

qwop seems to have already implemented the changes in his repository, and the code for reading the new quantized models is ready here: https://github.com/oobabooga/text-generation-webui/pull/530

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Embracing Component-Based Templates with JinjaX
1 project | dev.to | 26 Apr 2024
Turbocharge your Lambda Functions with AWS Lambda Powertools for Python
1 project | dev.to | 25 Apr 2024
PyPy v7.3.16 Release
4 projects | news.ycombinator.com | 24 Apr 2024
Show HN: I built a self-hosted status page and monitoring tool for my projects
4 projects | news.ycombinator.com | 25 Apr 2024
Tribler: An attack-resilient micro-economy for media
5 projects | news.ycombinator.com | 25 Apr 2024

Better QPTQ-quantized LLaMa soon? Paper authors reach out to improve quantization code

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Post date: 23 Mar 2023

GPTQ-for-LLaMa

text-generation-webui

WorkOS

Related posts