FQ-ViT vs neural-compressor

FQ-ViT

[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer (by megvii-research)

Source Code

Suggest alternative

Edit details

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime (by intel)

low-precision pruning sparsity auto-tuning knowledge-distillation quantization quantization-aware-training post-training-quantization Deep Learning smoothquant

Source Code

intel.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

FQ-ViT		neural-compressor
	Project
2	Mentions	3
263	Stars	1,971
0.4%	Growth	4.4%
1.1	Activity	9.8
about 1 year ago	Latest Commit	3 days ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

FQ-ViT

Posts with mentions or reviews of FQ-ViT. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-06-20.

How to quantize a Swin transformer model?
3 projects | /r/deeplearning | 20 Jun 2022

This my implementation on the approach I shared( https://github.com/megvii-research/FQ-ViT ) on a small dataset from kaggle(link: https://www.kaggle.com/datasets/gauravduttakiit/ants-bees) in this notebook :https://colab.research.google.com/drive/1cqnmosPIVZu3e2SwbO_VbevANk5MppVS?usp=sharing

neural-compressor

Posts with mentions or reviews of neural-compressor. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-07-26.

Intel Textual Inversion Training on Hugging Face
1 project | /r/StableDiffusion | 22 Dec 2022
An open-source library for optimizing deep learning inference. (1) You select the target optimization, (2) nebullvm searches for the best optimization techniques for your model-hardware configuration, and then (3) serves an optimized model that runs much faster in inference
10 projects | /r/learnmachinelearning | 26 Jul 2022

Open-source projects leveraged by nebullvm include OpenVINO, TensorRT, Intel Neural Compressor, SparseML and DeepSparse, Apache TVM, ONNX Runtime, TFlite and XLA. A huge thank you to the open-source community for developing and maintaining these amazing projects.
Meet Intel® Neural Compressor: An Open-Source Python Library for Model Compression that Reduces the Model Size and Increases the Speed of Deep Learning Inference for Deployment on CPUs or GPUs
1 project | /r/Python | 18 Jul 2022

Continue reading | The Github repo for the library can be accessed here.

What are some alternatives?

When comparing FQ-ViT and neural-compressor you can also consider the following projects:

Efficient-AI-Backbones - Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

openvino - OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Sparsebit - A model compression and acceleration toolbox based on pytorch.

tflite-micro - Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).

transformer-quantization

mmrazor - OpenMMLab Model Compression Toolbox and Benchmark.

nebuly - The user analytics platform for LLMs

TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

tvm - Open deep learning compiler stack for cpu, gpu and specialized accelerators

Lion - Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"

sparseml - Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

deepsparse - Sparsity-aware deep learning inference runtime for CPUs

FQ-ViT vs Efficient-AI-Backbones neural-compressor vs openvino FQ-ViT vs Sparsebit neural-compressor vs tflite-micro FQ-ViT vs transformer-quantization neural-compressor vs mmrazor neural-compressor vs nebuly neural-compressor vs TensorRT neural-compressor vs tvm neural-compressor vs Lion neural-compressor vs sparseml neural-compressor vs deepsparse

Compare FQ-ViT vs neural-compressor and see what are their differences.

FQ-ViT

neural-compressor

FQ-ViT

neural-compressor

What are some alternatives?