[D] Extracting the class labels and bounding boxes for objects, from a YOLO7 model after converting to an ONNX model

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • yolov7

    Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

  • (Please note, this is a re-post of my original question here, I think this subreddit might be more appropriate for asking this question)At work, we use Unity, we have a project that needs object detection and classification. We decided to use this YOLO7 model (for non-technical reasons, It had to be the exact same model as the company does have pre-trained weights for this exact model). However, Unity only supports ONNX so I exported the model as an ONNX model, using the code provided in the repo:

  • barracuda-release

  • I removed --end2end from the export flags, which removed the warning, and for the Resize warning I followed the advice on a similar issue

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ONNX-YOLOv7-Object-Detection

    Python scripts performing object detection using the YOLOv7 model in ONNX.

  • Finally, I tried to look if someone has done similar work for the ONNX model and I found this repo which links the same repo I am trying to use, and I believe this function is doing exactly what I want to do, but I could not understand what it is doing (I don't understand how it knows exactly where the number of detections is, and where the bounding boxes are and the class labels, etc.) furthermore, I am not sure if removing end2end and the changing the version from 12 to 9 has any effect on the output shape or it has to do with the internal layers.

  • yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

  • Those dimensions suggest you need to apply (i.e. roll your own) non-max suppresion to the outputs: relevant link

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts