I was looking for some great quantization open-source libraries that could actually be applied in production (both edge or cloud CPU/GPU). Do you know if I am missing any good libraries?

This page summarizes the projects mentioned and recommended in the original post on /r/learnmachinelearning

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. TensorRT

    NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

    Nvidia Quantization | Quantization with TensorRT

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. optimate

    A collection of libraries to optimise AI model performances

    Nebullvm | Easy-to-use library to boost AI inference leveraging state-of-the-art optimization techniques

  4. aimet

    AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

    Qualcomm AIMET | Advanced quantization and compression techniques for trained neural network models

  5. Model-Compression-Research-Package

    A library for researching neural networks compression and acceleration methods.

    Intel Labs compression | Researching neural networks compression and acceleration methods.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • AMD MI300X 30% higher performance than Nvidia H100, even with optimized stack

    1 project | news.ycombinator.com | 17 Dec 2023
  • Getting SDXL-turbo running with tensorRT

    1 project | /r/StableDiffusion | 6 Dec 2023
  • Show HN: Ollama for Linux – Run LLMs on Linux with GPU Acceleration

    14 projects | news.ycombinator.com | 26 Sep 2023
  • Train Your AI Model Once and Deploy on Any Cloud

    3 projects | news.ycombinator.com | 8 Jul 2023
  • A1111 just added support for TensorRT for webui as an extension!

    5 projects | /r/StableDiffusion | 27 May 2023

Did you know that Python is
the 2nd most popular programming language
based on number of references?