AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation: (by casper-hansen)

AutoAWQ Alternatives

Similar projects and alternatives to AutoAWQ

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better AutoAWQ alternative or higher similarity.

AutoAWQ reviews and mentions

Posts with mentions or reviews of AutoAWQ. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-23.
  • AMD ROCm Software Blogs
    4 projects | news.ycombinator.com | 23 Feb 2024
    Thanks! Again, partnerships over customers. If you're experienced and have the technical chops to make a MI300x sing, we want to work with you. Our model is that we are the capex/opex investor for businesses. As much as I love software, Hot Aisle is more of a hardware business. Running super high end large scale compute is an extreme challenge in itself. We are less interested in building the software side of things and want to foster those who can focus on that side.

    https://github.com/unslothai/unsloth/issues/160

    https://github.com/search?q=repo%3Apredibase%2Florax+rocm&ty...

    https://github.com/sgl-project/sglang/issues/157

    https://github.com/casper-hansen/AutoAWQ (supports rocm)

  • 1,200 tokens per second for Llama 2 7B on H100!
    2 projects | /r/LocalLLaMA | 6 Dec 2023
    Apparently you can reach about 800 t/s with rtx 4090 using autoawq with batch size 8. https://github.com/casper-hansen/AutoAWQ

Stats

Basic AutoAWQ repo stats
2
1,242
9.5
6 days ago

casper-hansen/AutoAWQ is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of AutoAWQ is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com