SqueezeLLM Alternatives

Similar projects and alternatives to SqueezeLLM

llm-awq

7 1,794 8.0 Python SqueezeLLM VS llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Qwen-7B

2 5,030 8.3 Python SqueezeLLM VS Qwen-7B

Discontinued The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. [Moved to: https://github.com/QwenLM/Qwen]
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Qwen

5 11,064 9.4 Python SqueezeLLM VS Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Pretrained-Language-Model

1 2,960 6.1 Python SqueezeLLM VS Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
GoLLIE

1 208 9.6 Python SqueezeLLM VS GoLLIE

Guideline following Large Language Model for Information Extraction
LLMCompiler

2 1,069 7.6 Python SqueezeLLM VS LLMCompiler

LLMCompiler: An LLM Compiler for Parallel Function Calling
LocalMentor

1 21 6.5 Jupyter Notebook SqueezeLLM VS LocalMentor

Local Startup Advisor Chatbot
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better SqueezeLLM alternative or higher similarity.

Suggest an alternative to SqueezeLLM

SqueezeLLM reviews and mentions

Posts with mentions or reviews of SqueezeLLM. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-05.

Llama33B vs Falcon40B vs MPT30B
2 projects | /r/LocalLLaMA | 5 Jul 2023

Using the currently popular gptq the 3bit quantization hurts performance much more than 4bit, but there's also awq (https://github.com/mit-han-lab/llm-awq) and squishllm (https://github.com/SqueezeAILab/SqueezeLLM) which are able to manage 3bit without as much performance drop - I hope to see them used more commonly.
Has anyone tried out Squeezellm?
1 project | /r/LocalLLaMA | 2 Jul 2023

[Paper][Github][Model]
SqueezeLLM: Dense-and-Sparse Quantization
1 project | news.ycombinator.com | 15 Jun 2023
New quantization method SqueezeLLM allows for loseless compression for 3-bit and outperforms GPTQ and AWQ in both 3-bit and 4-bit. Quantized Vicuna and LLaMA models have been released.
2 projects | /r/LocalLLaMA | 14 Jun 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 2 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic SqueezeLLM repo stats

Mentions

Stars

569

Activity

6.9

Last Commit

about 22 hours ago

SqueezeAILab/SqueezeLLM is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of SqueezeLLM is Python.

Popular Comparisons