Python model-compression

Open-source Python projects categorized as model-compression

Top 17 Python model-compression Projects

model-compression
  1. Efficient-AI-Backbones

    Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. Torch-Pruning

    [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

  4. Pretrained-Language-Model

    Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

  5. model-optimization

    A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

    Project mention: 95% Accurate Wake Word Detection: Low-Power CNN + MFCC Guide | dev.to | 2025-10-19

    TensorFlow Model Optimization Toolkit

  6. DeepCache

    [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

  7. SqueezeLLM

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

  8. archai

    Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

  9. KVQuant

    [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

  10. q-diffusion

    [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

  11. picollm

    On-device LLM Inference Powered by X-Bit Quantization

  12. only_train_once_personal_footprint

    OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM

  13. SVD-LLM

    [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

  14. EuLLM

    Open-source platform for creating, distributing and running sovereign EU-compliant LLMs. Verticalize any model for your domain, language and brand. AI Act ready.

    Project mention: Show HN: Replacing cloud LLM APIs with local, domain-specific models | news.ycombinator.com | 2026-03-25
  15. MQAT

    [TMLR, 2024] Modular Quantization-Aware Training for 6D Object Pose Estimation

  16. SatQuant

    Fixing TFLite INT8 quantization for small objects in satellite imagery. A drop-in wrapper for focus-based calibration.

    Project mention: SatQuant: Fix YOLOv8 quantization accuracy on satellite imagery (Edge TPU) | news.ycombinator.com | 2025-11-28
  17. glq

    E8 lattice codebook quantization for LLM weights — 2/3/4 bpw with fused Triton inference kernel

    Project mention: Show HN: Glq LLM quantization using E8 lattice | news.ycombinator.com | 2026-06-01
  18. reap-mlx

    MLX-compatible REAP for pruning MoE models on Apple Silicon

    Project mention: Show HN: Ported Cerebras Reap to MLX – Prune Moe Experts on a MacBook | news.ycombinator.com | 2026-06-01
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python model-compression discussion

Log in or Post with

Python model-compression related posts

  • CVPR Edition: Voxel51 Filtered Views Newsletter - June 21, 2024

    5 projects | dev.to | 21 Jun 2024
  • Llama33B vs Falcon40B vs MPT30B

    2 projects | /r/LocalLLaMA | 5 Jul 2023
  • [P] Help: I want to compress EfficientnetV2 using pruning.

    1 project | /r/MachineLearning | 28 Jun 2023
  • SqueezeLLM: Dense-and-Sparse Quantization

    1 project | news.ycombinator.com | 15 Jun 2023
  • New quantization method SqueezeLLM allows for loseless compression for 3-bit and outperforms GPTQ and AWQ in both 3-bit and 4-bit. Quantized Vicuna and LLaMA models have been released.

    2 projects | /r/LocalLLaMA | 14 Jun 2023
  • Researchers From China Introduce Vision GNN (ViG): A Graph Neural Network For Computer Vision Systems

    1 project | /r/machinelearningnews | 8 Jun 2022
  • GNN for computer vision, beating CNN & Transformer

    1 project | /r/deeplearning | 4 Jun 2022
  • A note from our sponsor - SaaSHub
    www.saashub.com | 10 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source model-compression projects in Python? This list will help you:

# Project Stars
1 Efficient-AI-Backbones 4,417
2 Torch-Pruning 3,313
3 Pretrained-Language-Model 3,158
4 model-optimization 1,573
5 DeepCache 964
6 SqueezeLLM 718
7 archai 485
8 KVQuant 421
9 q-diffusion 370
10 picollm 311
11 only_train_once_personal_footprint 310
12 SVD-LLM 295
13 EuLLM 25
14 MQAT 6
15 SatQuant 3
16 glq 3
17 reap-mlx 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?