YOLOv6: Redefine state-of-the-art for object detection

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • YOLOv6

    YOLOv6: a single-stage object detection framework dedicated to industrial applications.

    https://github.com/meituan/YOLOv6/blob/main/docs/About_namin...

    > P.S. We are contacting the authors of YOLO series about the naming of YOLOv6.

    You should ask _before_ publishing, not _after_.

    They claim it runs faster and is more accurate than YOLOv5, yet requires 3x as much computation (GFLOPs)? Something doesn't add up here.

    There is unbelievably little information about the architecture too. Unfortunately it's not in a format I can easily throw the cfg in as visualize it: https://gitlab.com/danbarry16/darknet-visual

    This appears to be on purpose to advertise DagsHub: https://dagshub.com/pricing

  • edgetpu

    Coral issue tracker (and legacy Edge TPU API source)

    Is this available for https://coral.ai/ somehow? Would it be difficult to convert it?

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • yolov7_d2

    🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥

  • edgetpu-yolo

    Minimal-dependency Yolov5 export and inference demonstration for the Google Coral EdgeTPU

  • https://github.com/meituan/YOLOv6/blob/main/docs/About_namin...

    > P.S. We are contacting the authors of YOLO series about the naming of YOLOv6.

    You should ask _before_ publishing, not _after_.

    They claim it runs faster and is more accurate than YOLOv5, yet requires 3x as much computation (GFLOPs)? Something doesn't add up here.

    There is unbelievably little information about the architecture too. Unfortunately it's not in a format I can easily throw the cfg in as visualize it: https://gitlab.com/danbarry16/darknet-visual

    This appears to be on purpose to advertise DagsHub: https://dagshub.com/pricing

  • yolact

    A simple, fully convolutional model for real-time instance segmentation.

  • PixelLib

    Visit PixelLib's official documentation https://pixellib.readthedocs.io/en/latest/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • CATNet

    🛰️ Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images (TNNLS 2023)

  • CLIP

    CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

    Thanks. The next step would be combining it with text-image foundation models such as clip https://github.com/openai/CLIP (right?)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts