I want to create a pill counter using points instead of bounding boxes. What model should I train from?

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

OpenCV

196 75,423 9.9 C++

Open Source Computer Vision Library

I've done similar jobs using openCV for counting cells in microscope, or cars in parking lots. It's a very straightforward approach.

mmdetection

23 27,742 8.7 Python

OpenMMLab Detection Toolbox and Benchmark

If you are really that lazy: use bboxes of the fixed size placed in the center of the pill. The pill does not have to fit into the box - modern architectures see the image as a whole, not only the crop in the box. For example if you would train detection on labels which are shifted (add 30px to each label coordinate), the network would learn to place each box 30px next to the actual object. So just let small box represent the center of the pill. The problem will arise if you will use improperly configured architecture, i.e. if you will not change the anchors in SSD model. Try efficientdet architecture implemented in mmdetection or, the easiest, yolov5. These should work out of the box.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
yolov5

129 46,738 8.9 Python

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

If you are really that lazy: use bboxes of the fixed size placed in the center of the pill. The pill does not have to fit into the box - modern architectures see the image as a whole, not only the crop in the box. For example if you would train detection on labels which are shifted (add 30px to each label coordinate), the network would learn to place each box 30px next to the actual object. So just let small box represent the center of the pill. The problem will arise if you will use improperly configured architecture, i.e. if you will not change the anchors in SSD model. Try efficientdet architecture implemented in mmdetection or, the easiest, yolov5. These should work out of the box.

CenterNet

6 7,101 0.0 Python

Object detection, 3D detection, and pose estimation using center point detection:

Take a look at this centernet architecture.

DeepLabCut

12 4,283 8.7 Python

Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

Well you could try DeepLabCut - https://github.com/DeepLabCut/DeepLabCut

DeepPoseKit

5 379 0.0 Python

a toolkit for pose estimation using deep learning

Or a variation called DeepPoseKit - https://github.com/jgraving/DeepPoseKit which hasn't been as updated as recently but is easier to batch / code.

pose-tensorflow

1 1,133 10.0 C++

Human Pose estimation with TensorFlow framework

Also DeepLabCut uses primarily videos. It's built on the stacked hourglass method from this repo: https://github.com/eldar/pose-tensorflow

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project