[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Why do you think that https://github.com/SqueezeAILab/LLMCompiler is a good alternative to SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Why do you think that https://github.com/SqueezeAILab/LLMCompiler is a good alternative to SqueezeLLM