Single RTX 3080 or two RTX 3060s for deep learning inference?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

server

24 7,356 9.5 Python

The Triton Inference Server provides an optimized cloud and edge inferencing solution. (by triton-inference-server)

For inference of CNNs, memory should really not be an issue. If it is a software engineering problem, not a hardware issue. FP16 or Int8 for weights is fine and weight size won’t increase due to the high resolution. And during inference memory used for hidden layer tensors can be reused as soon as the last consumer layer has been processed. You likely using something that is designed for training for inference and that blows up the memory requirement, or if you are using TensorRT or something like that, you need to be careful to avoid that every tasks loads their own copy of the library code into the GPU. Maybe look at https://github.com/triton-inference-server/server

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

"A matching Triton is not available"

1 project | /r/StableDiffusion | 15 Oct 2023
Triton Inference Server - Backend

2 projects | /r/learnmachinelearning | 13 Jun 2023
I mean,.. we COULD just make our own lol

4 projects | /r/replika | 12 Feb 2023
What are some essential tools to learn for MLE?

1 project | /r/learnmachinelearning | 19 Dec 2022
An Initial Look at Deep Learning IO Performance

1 project | /r/learnmachinelearning | 29 Nov 2022

Single RTX 3080 or two RTX 3060s for deep learning inference?

This page summarizes the projects mentioned and recommended in the original post on /r/computervision
Inference GPU Machine Learning Deep Learning Cloud
Post date: 12 Apr 2023

server

InfluxDB

Related posts

"A matching Triton is not available"

Triton Inference Server - Backend

I mean,.. we COULD just make our own lol

What are some essential tools to learn for MLE?

An Initial Look at Deep Learning IO Performance

Single RTX 3080 or two RTX 3060s for deep learning inference?

This page summarizes the projects mentioned and recommended in the original post on /r/computervision Inference GPU Machine Learning Deep Learning Cloud Post date: 12 Apr 2023

server

InfluxDB

Related posts

"A matching Triton is not available"

Triton Inference Server - Backend

I mean,.. we COULD just make our own lol

What are some essential tools to learn for MLE?

An Initial Look at Deep Learning IO Performance

This page summarizes the projects mentioned and recommended in the original post on /r/computervision
Inference GPU Machine Learning Deep Learning Cloud
Post date: 12 Apr 2023