[D] Extracting the class labels and bounding boxes for objects, from a YOLO7 model after converting to an ONNX model

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

yolov7

33 12,681 4.0 Jupyter Notebook

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

(Please note, this is a re-post of my original question here, I think this subreddit might be more appropriate for asking this question)At work, we use Unity, we have a project that needs object detection and classification. We decided to use this YOLO7 model (for non-technical reasons, It had to be the exact same model as the company does have pre-trained weights for this exact model). However, Unity only supports ONNX so I exported the model as an ONNX model, using the code provided in the repo:

barracuda-release

1 551 3.6 C#

I removed --end2end from the export flags, which removed the warning, and for the Resize warning I followed the advice on a similar issue

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
ONNX-YOLOv7-Object-Detection

2 182 0.0 Python

Python scripts performing object detection using the YOLOv7 model in ONNX.

Finally, I tried to look if someone has done similar work for the ONNX model and I found this repo which links the same repo I am trying to use, and I believe this function is doing exactly what I want to do, but I could not understand what it is doing (I don't understand how it knows exactly where the number of detections is, and where the bounding boxes are and the class labels, etc.) furthermore, I am not sure if removing end2end and the changing the version from 12 to 9 has any effect on the output shape or it has to do with the internal layers.

yolov5

129 46,738 8.9 Python

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Those dimensions suggest you need to apply (i.e. roll your own) non-max suppresion to the outputs: relevant link

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project