Model-Compression-Research-Package
aimet
Model-Compression-Research-Package | aimet | |
---|---|---|
1 | 2 | |
133 | 1,943 | |
1.5% | 4.2% | |
5.3 | 9.6 | |
4 months ago | 4 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Model-Compression-Research-Package
-
I was looking for some great quantization open-source libraries that could actually be applied in production (both edge or cloud CPU/GPU). Do you know if I am missing any good libraries?
Intel Labs compression | Researching neural networks compression and acceleration methods.
aimet
-
I was looking for some great quantization open-source libraries that could actually be applied in production (both edge or cloud CPU/GPU). Do you know if I am missing any good libraries?
Qualcomm AIMET | Advanced quantization and compression techniques for trained neural network models
-
Model/Tool to use on Jetson for efficient Quantization/Pruning
Qualcomm AIMET may help you
What are some alternatives?
nebuly - The user analytics platform for LLMs
tkDNN - Deep neural network library and toolkit to do high performace inference on NVIDIA jetson platforms
TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
ludwig - Low-code framework for building custom LLMs, neural networks, and other AI models
open-lpr - Open Source and Free License Plate Recognition Software
model-optimization - A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
qkeras - QKeras: a quantization deep learning library for Tensorflow Keras
distiller - Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
orion - Asynchronous Distributed Hyperparameter Optimization.
elasticsearch-stress-test - Stress test tool for Elasticsearch