[P] Help: I want to compress EfficientnetV2 using pruning.

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Torch-Pruning

2 2,307 9.4 Python

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

I also tried structured pruning from https://github.com/VainF/Torch-Pruning, as they report EfficientNetV2 to be "prunable", but got much worse results. However, the advantage of this approach is that it keeps the model dense, and you can get a real speed-up with common GPUs, while unstructured pruning sparsifies the model and you need hardware that can exploit such sparsity.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Llama33B vs Falcon40B vs MPT30B

2 projects | /r/LocalLLaMA | 5 Jul 2023
Has anyone tried out Squeezellm?

1 project | /r/LocalLLaMA | 2 Jul 2023
SqueezeLLM: Dense-and-Sparse Quantization

1 project | news.ycombinator.com | 15 Jun 2023
New quantization method SqueezeLLM allows for loseless compression for 3-bit and outperforms GPTQ and AWQ in both 3-bit and 4-bit. Quantized Vicuna and LLaMA models have been released.

2 projects | /r/LocalLLaMA | 14 Jun 2023
[R] 🤖🌟 Unlock the Power of Personal AI: Introducing ChatLLaMA, Your Custom Personal Assistant! 🚀💬

9 projects | /r/MachineLearning | 19 Mar 2023

[P] Help: I want to compress EfficientnetV2 using pruning.

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
pruning model-compression network-pruning channel-pruning structural-pruning
Post date: 28 Jun 2023

Torch-Pruning

InfluxDB

Related posts

Llama33B vs Falcon40B vs MPT30B

Has anyone tried out Squeezellm?

SqueezeLLM: Dense-and-Sparse Quantization

New quantization method SqueezeLLM allows for loseless compression for 3-bit and outperforms GPTQ and AWQ in both 3-bit and 4-bit. Quantized Vicuna and LLaMA models have been released.

[R] 🤖🌟 Unlock the Power of Personal AI: Introducing ChatLLaMA, Your Custom Personal Assistant! 🚀💬

[P] Help: I want to compress EfficientnetV2 using pruning.

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning pruning model-compression network-pruning channel-pruning structural-pruning Post date: 28 Jun 2023

Torch-Pruning

InfluxDB

Related posts

Llama33B vs Falcon40B vs MPT30B

Has anyone tried out Squeezellm?

SqueezeLLM: Dense-and-Sparse Quantization

New quantization method SqueezeLLM allows for loseless compression for 3-bit and outperforms GPTQ and AWQ in both 3-bit and 4-bit. Quantized Vicuna and LLaMA models have been released.

[R] 🤖🌟 Unlock the Power of Personal AI: Introducing ChatLLaMA, Your Custom Personal Assistant! 🚀💬

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
pruning model-compression network-pruning channel-pruning structural-pruning
Post date: 28 Jun 2023