TensorRT VS examples

Compare TensorRT vs examples and see what are their differences.

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT. (by NVIDIA)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
TensorRT examples
22 142
8,891 7,699
3.6% 1.2%
6.1 6.2
about 2 months ago 7 days ago
C++ Jupyter Notebook
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

TensorRT

Posts with mentions or reviews of TensorRT. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-26.

examples

Posts with mentions or reviews of examples. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-19.

What are some alternatives?

When comparing TensorRT and examples you can also consider the following projects:

DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

FasterTransformer - Transformer related optimization, including BERT, GPT

onnx-tensorrt - ONNX-TensorRT: TensorRT backend for ONNX

vllm - A high-throughput and memory-efficient inference and serving engine for LLMs

stable-diffusion-webui - Stable Diffusion web UI

openvino - OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

flash-attention - Fast and memory-efficient exact attention

tvm - Open deep learning compiler stack for cpu, gpu and specialized accelerators

tensorrtx - Implementation of popular deep learning networks with TensorRT network definition API

llama.cpp - LLM inference in C/C++

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

whisper.cpp - Port of OpenAI's Whisper model in C/C++