neural-compressor VS nebuly

Compare neural-compressor vs nebuly and see what are their differences.

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
neural-compressor nebuly
3 105
1,950 8,367
6.5% 0.3%
9.8 8.4
4 days ago 6 months ago
Python Python
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

neural-compressor

Posts with mentions or reviews of neural-compressor. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-07-26.

nebuly

Posts with mentions or reviews of nebuly. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-04.

What are some alternatives?

When comparing neural-compressor and nebuly you can also consider the following projects:

openvino - OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

tvm - Open deep learning compiler stack for cpu, gpu and specialized accelerators

tflite-micro - Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).

AITemplate - AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

mmrazor - OpenMMLab Model Compression Toolbox and Benchmark.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

alpaca-lora - Instruct-tune LLaMA on consumer hardware

Lion - Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"

deepsparse - Sparsity-aware deep learning inference runtime for CPUs