I want to create a pill counter using points instead of bounding boxes. What model should I train from?

This page summarizes the projects mentioned and recommended in the original post on /r/learnmachinelearning

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • OpenCV

    Open Source Computer Vision Library

  • I've done similar jobs using openCV for counting cells in microscope, or cars in parking lots. It's a very straightforward approach.

  • mmdetection

    OpenMMLab Detection Toolbox and Benchmark

  • If you are really that lazy: use bboxes of the fixed size placed in the center of the pill. The pill does not have to fit into the box - modern architectures see the image as a whole, not only the crop in the box. For example if you would train detection on labels which are shifted (add 30px to each label coordinate), the network would learn to place each box 30px next to the actual object. So just let small box represent the center of the pill. The problem will arise if you will use improperly configured architecture, i.e. if you will not change the anchors in SSD model. Try efficientdet architecture implemented in mmdetection or, the easiest, yolov5. These should work out of the box.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

  • If you are really that lazy: use bboxes of the fixed size placed in the center of the pill. The pill does not have to fit into the box - modern architectures see the image as a whole, not only the crop in the box. For example if you would train detection on labels which are shifted (add 30px to each label coordinate), the network would learn to place each box 30px next to the actual object. So just let small box represent the center of the pill. The problem will arise if you will use improperly configured architecture, i.e. if you will not change the anchors in SSD model. Try efficientdet architecture implemented in mmdetection or, the easiest, yolov5. These should work out of the box.

  • CenterNet

    Object detection, 3D detection, and pose estimation using center point detection:

  • Take a look at this centernet architecture.

  • DeepLabCut

    Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

  • Well you could try DeepLabCut - https://github.com/DeepLabCut/DeepLabCut

  • DeepPoseKit

    a toolkit for pose estimation using deep learning

  • Or a variation called DeepPoseKit - https://github.com/jgraving/DeepPoseKit which hasn't been as updated as recently but is easier to batch / code.

  • pose-tensorflow

    Human Pose estimation with TensorFlow framework

  • Also DeepLabCut uses primarily videos. It's built on the stacked hourglass method from this repo: https://github.com/eldar/pose-tensorflow

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts