aimet
Model-Compression-Research-Package
aimet | Model-Compression-Research-Package | |
---|---|---|
2 | 1 | |
1,911 | 133 | |
2.6% | 1.5% | |
9.6 | 5.3 | |
4 days ago | 3 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
aimet
-
I was looking for some great quantization open-source libraries that could actually be applied in production (both edge or cloud CPU/GPU). Do you know if I am missing any good libraries?
Qualcomm AIMET | Advanced quantization and compression techniques for trained neural network models
-
Model/Tool to use on Jetson for efficient Quantization/Pruning
Qualcomm AIMET may help you
Model-Compression-Research-Package
-
I was looking for some great quantization open-source libraries that could actually be applied in production (both edge or cloud CPU/GPU). Do you know if I am missing any good libraries?
Intel Labs compression | Researching neural networks compression and acceleration methods.
What are some alternatives?
tkDNN - Deep neural network library and toolkit to do high performace inference on NVIDIA jetson platforms
nebuly - The user analytics platform for LLMs
ludwig - Low-code framework for building custom LLMs, neural networks, and other AI models
TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
open-lpr - Open Source and Free License Plate Recognition Software
model-optimization - A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
qkeras - QKeras: a quantization deep learning library for Tensorflow Keras
distiller - Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
orion - Asynchronous Distributed Hyperparameter Optimization.
elasticsearch-stress-test - Stress test tool for Elasticsearch