TensorRT VS onnx-tensorrt

Compare TensorRT vs onnx-tensorrt and see what are their differences.

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT. (by NVIDIA)

onnx-tensorrt

ONNX-TensorRT: TensorRT backend for ONNX (by onnx)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
TensorRT onnx-tensorrt
22 4
9,031 2,745
3.6% 2.0%
5.0 4.1
13 days ago 12 days ago
C++ C++
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

TensorRT

Posts with mentions or reviews of TensorRT. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-26.

onnx-tensorrt

Posts with mentions or reviews of onnx-tensorrt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-06-02.
  • [P] [D]How to get TensorFlow model to run on Jetson Nano?
    4 projects | /r/MachineLearning | 2 Jun 2021
    Conversion was done from Keras Tensorflow using to ONNX https://github.com/onnx/keras-onnx followed by ONNX to TensorRT using https://github.com/onnx/onnx-tensorrt The Python code used for inference using TensorRT can be found at https://github.com/jonnor/modeld/blob/tensorrt/tensorrtutils.py

What are some alternatives?

When comparing TensorRT and onnx-tensorrt you can also consider the following projects:

DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

FasterTransformer - Transformer related optimization, including BERT, GPT

onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

vllm - A high-throughput and memory-efficient inference and serving engine for LLMs

openvino - OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

stable-diffusion-webui - Stable Diffusion web UI

flash-attention - Fast and memory-efficient exact attention

jetson-inference - Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

tvm - Open deep learning compiler stack for cpu, gpu and specialized accelerators

tensorrtx - Implementation of popular deep learning networks with TensorRT network definition API

llama.cpp - LLM inference in C/C++

whisper - Robust Speech Recognition via Large-Scale Weak Supervision