Llm-awq Alternatives

Similar projects and alternatives to llm-awq

GPTQ-for-LLaMa

75 2,913 8.6 Python llm-awq VS GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ
FLiPStackWeekly

80 14 9.9 llm-awq VS FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
localsend

64 34,989 9.7 Dart llm-awq VS localsend

An open-source cross-platform alternative to AirDrop
Voyager

53 5,152 4.7 JavaScript llm-awq VS Voyager

An Open-Ended Embodied Agent with Large Language Models (by MineDojo)
formbricks

25 5,475 9.9 TypeScript llm-awq VS formbricks

Open Source Survey Platform
CodeGen

18 4,762 6.1 Python llm-awq VS CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
amazon-bedrock-with-builder-and-command-patterns

15 10 6.1 Java llm-awq VS amazon-bedrock-with-builder-and-command-patterns

A simple, yet powerful implementation in Java that allows developers to write a rather straightforward code to create the API requests for the different foundation models supported by Amazon Bedrock.
WorkOS

workos.com featured

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
SqueezeLLM

5 566 7.3 Python llm-awq VS SqueezeLLM

SqueezeLLM: Dense-and-Sparse Quantization
FLaNK-Halifax

14 1 7.3 TypeScript llm-awq VS FLaNK-Halifax

Community over Code, Apache NiFi, Apache Kafka, Apache Flink, Python, GTFS, Transit, Open Source, Open Data
InternLM

10 5,186 9.0 Python llm-awq VS InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support
pejorative-compounds

7 89 0.0 Jupyter Notebook llm-awq VS pejorative-compounds

Analysing patterns in English noun-noun pejorative compounds on Reddit
langchain4j-examples

3 355 8.8 Java llm-awq VS langchain4j-examples
CoC2023

8 2 6.1 llm-awq VS CoC2023

Community over Code, Apache NiFi, Apache Kafka, Apache Flink, Python, GTFS, Transit, Open Source, Open Data
CML_AMP_AI_Text_Summarization_with_Amazon_Bedrock

2 1 4.9 Jupyter Notebook llm-awq VS CML_AMP_AI_Text_Summarization_with_Amazon_Bedrock

CML_AMP_AI_Text_Summarization_with_Amazon_Bedrock
kafka-streams-dashboards

2 32 6.9 Java llm-awq VS kafka-streams-dashboards

showcases Grafana dashboards for Kafka Stream applications leveraging client JMX metrics.
data-in-motion

1 5 8.1 llm-awq VS data-in-motion

This is repository for tutorials of Data In Motion starting with Data Distribution
stable-audio-tools

4 1,419 8.0 Python llm-awq VS stable-audio-tools

Generative models for conditional audio generation
optiagent

1 1 3.0 Python llm-awq VS optiagent

autonomous agents for competitive intelligence!
lang2sql

1 235 8.0 Jupyter Notebook llm-awq VS lang2sql

A tutorial for setting an SQL code generator with the OpenAI API
nifiConcurrencyDuration

1 2 6.9 Python llm-awq VS nifiConcurrencyDuration

Search the nifi config for excessive concurrentlySchedulableTaskCount - if desired update to lower value and increase processor duration
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better llm-awq alternative or higher similarity.

Suggest an alternative to llm-awq

llm-awq reviews and mentions

Posts with mentions or reviews of llm-awq. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-23.

TinyChat: Large Language Model on the Edge
1 project | news.ycombinator.com | 8 Dec 2023

TinyChat is an efficient, lightweight, Python-native serving framework for 4-bit LLMs by AWQ. It delivers 2.3x generation speed up on RTX4090.
Code: https://github.com/mit-han-lab/llm-awq/tree/main/tinychat
FLaNK Stack Weekly 23 Oct 2023
17 projects | dev.to | 23 Oct 2023
New base model InternLM 7B weights released, with 8k context window.
2 projects | /r/LocalLLaMA | 6 Jul 2023

I am having trouble finding any 8bit GPTQ models at all, there don't seem to be any on HF it's almost all 4bit with the odd 3bit of the big ones. Suspect I will have to make my own for eval purposes but it's lower priority on my list then finding a 4bit that's GPU friendly but doesn't have such a performance penalty... Looking at AWQ they have 3 and 4bit versions.
Llama33B vs Falcon40B vs MPT30B
2 projects | /r/LocalLLaMA | 5 Jul 2023

Using the currently popular gptq the 3bit quantization hurts performance much more than 4bit, but there's also awq (https://github.com/mit-han-lab/llm-awq) and squishllm (https://github.com/SqueezeAILab/SqueezeLLM) which are able to manage 3bit without as much performance drop - I hope to see them used more commonly.
New hardware-friendly quantization method
1 project | news.ycombinator.com | 2 Jun 2023
Activation-Aware Weight Quantization for LLM Compression Outperforms GPTQ
1 project | news.ycombinator.com | 2 Jun 2023

Better quantization would have a direct and meaningful impact for everyone running local LLMs. The technique has already been applied to both Vicuna and the multimodal LLaMA variant LLaVA.
https://github.com/mit-han-lab/llm-awq
New quantization method AWQ outperforms GPTQ in 4-bit and 3-bit with 1.45x speedup and works with multimodal LLMs
4 projects | /r/LocalLLaMA | 2 Jun 2023

GitHub: https://github.com/mit-han-lab/llm-awq
A note from our sponsor - SaaSHub
www.saashub.com | 30 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →