SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python object-detection Projects
-
Project mention: Transforming Unstructured Retail Catalogs into Structured Data using AI | dev.to | 2026-04-10
Before reading any text, we run the raw catalog pages through a custom object detection model (based on YOLO architecture). This model is trained to identify the bounding boxes of individual product regions, allowing us to crop the giant page into smaller, isolated product images.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi | dev.to | 2025-11-23
Ultralytics YOLO Documentation
-
Project mention: Benchmarking ASR & Essential Open-Source CV Tools for Local AI | dev.to | 2026-06-09
Source: https://github.com/roboflow/supervision
-
There certainly are issues on Linux as well. The Detectron2 library alone has several hundred issues related to incorrect versions of something: https://github.com/facebookresearch/detectron2/issues
The mmdetection library (https://github.com/open-mmlab/mmdetection/issues) also has hundreds of version-related issues. Admittedly, that library has not seen any updates for over a year now, but it is sad that things just break and become basically unusable on modern Linux operating systems because NVIDIA can't stop breaking backwards and forwards compatibility for what is essentially just fancy matrix multiplication.
-
-
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
-
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
-
pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
-
-
-
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
# Required packages pip install opencv-python numpy onnxruntime # Download YOLOX-Tiny ONNX model # From: https://github.com/Megvii-BaseDetection/YOLOX
-
Project mention: Gluon: a GPU programming language based on the same compiler stack as Triton | news.ycombinator.com | 2025-09-17
Amazon (+ Microsoft) already released a language for ML called gluon 8 years ago: https://aws.amazon.com/blogs/aws/introducing-gluon-a-new-lib...
autogluon is popular as well: https://github.com/autogluon/autogluon
-
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
-
X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Setup guide: https://github.com/CVHub520/X-AnyLabeling/blob/main/examples/interactive_video_object_segmentation/sam3/README.md
-
ImageAI
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
-
rf-detr
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning. [ICLR 2026]
Project mention: RF-DETR vs YOLO vs Cloud API: Which Should You Actually Use in 2026? | dev.to | 2026-03-20RF-DETR GitHub (Apache 2.0)
-
-
-
darkflow
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
-
Project mention: Gluon: a GPU programming language based on the same compiler stack as Triton | news.ycombinator.com | 2025-09-17
-
-
Tiled inference isn't a niche technique. It's the industry standard for any image that exceeds a model's input constraints. SAHI (Slicing Aided Hyper Inference) has over 35,000 stars on GitHub. It partitions images into overlapping slices, runs detection on each slice, and stitches results together. Digital pathology pipelines routinely tile gigapixel whole-slide images into thousands of patches for parallel inference. Satellite imagery processing architectures on AWS all involve the same core pattern: tile, infer in parallel, aggregate.
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Python object-detection discussion
Python object-detection related posts
-
From 9 Tiles to 900: Scaling Computer Vision Pipelines
-
RF-DETR vs YOLO vs Cloud API: Which Should You Actually Use in 2026?
-
Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi
-
Segment Anything 3
-
Labellerr YOLOv8: Cars and Number Plate Detection — Practical, Step-by-Step
-
Gluon: a GPU programming language based on the same compiler stack as Triton
-
Introdução a Visão Computacional - Parte 2: Detecção de Objetos com YOLO 11
-
A note from our sponsor - SaaSHub
www.saashub.com | 13 Jun 2026
Index
What are some of the best open-source object-detection projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | ultralytics | 58,307 |
| 2 | yolov5 | 57,519 |
| 3 | supervision | 43,984 |
| 4 | mmdetection | 32,695 |
| 5 | Mask_RCNN | 25,564 |
| 6 | Swin-Transformer | 15,857 |
| 7 | PaddleDetection | 14,241 |
| 8 | pytorch-grad-cam | 12,860 |
| 9 | fiftyone | 10,764 |
| 10 | yolov3 | 10,571 |
| 11 | YOLOX | 10,483 |
| 12 | autogluon | 10,466 |
| 13 | GroundingDINO | 10,063 |
| 14 | X-AnyLabeling | 9,398 |
| 15 | ImageAI | 8,867 |
| 16 | rf-detr | 7,688 |
| 17 | TensorLayer | 7,389 |
| 18 | mmdetection3d | 6,353 |
| 19 | darkflow | 6,147 |
| 20 | gluon-cv | 5,922 |
| 21 | layout-parser | 5,730 |
| 22 | sahi | 5,344 |
| 23 | AdelaiDet | 3,480 |