A model compression and acceleration toolbox based on pytorch.
Why do you think that https://github.com/AlpinDale/sparsegpt-for-LLaMA is a good alternative to Sparsebit
A model compression and acceleration toolbox based on pytorch.
Why do you think that https://github.com/AlpinDale/sparsegpt-for-LLaMA is a good alternative to Sparsebit