Python object-detection

Open-source Python projects categorized as object-detection

Top 23 Python object-detection Projects

object-detection
  1. ultralytics

    Ultralytics YOLO 🚀

    Project mention: Transforming Unstructured Retail Catalogs into Structured Data using AI | dev.to | 2026-04-10

    Before reading any text, we run the raw catalog pages through a custom object detection model (based on YOLO architecture). This model is trained to identify the bounding boxes of individual product regions, allowing us to crop the giant page into smaller, isolated product images.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. yolov5

    Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite

    Project mention: Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi | dev.to | 2025-11-23

    Ultralytics YOLO Documentation

  4. supervision

    We write your reusable computer vision tools. 💜

    Project mention: Benchmarking ASR & Essential Open-Source CV Tools for Local AI | dev.to | 2026-06-09

    Source: https://github.com/roboflow/supervision

  5. mmdetection

    OpenMMLab Detection Toolbox and Benchmark

    Project mention: PYX: The next step in Python packaging | news.ycombinator.com | 2025-08-13

    There certainly are issues on Linux as well. The Detectron2 library alone has several hundred issues related to incorrect versions of something: https://github.com/facebookresearch/detectron2/issues

    The mmdetection library (https://github.com/open-mmlab/mmdetection/issues) also has hundreds of version-related issues. Admittedly, that library has not seen any updates for over a year now, but it is sad that things just break and become basically unusable on modern Linux operating systems because NVIDIA can't stop breaking backwards and forwards compatibility for what is essentially just fancy matrix multiplication.

  6. Mask_RCNN

    Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

  7. Swin-Transformer

    This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

  8. PaddleDetection

    Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

  9. pytorch-grad-cam

    Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

  10. fiftyone

    Refine high-quality datasets and visual AI models

  11. yolov3

    Ultralytics YOLOv3 in PyTorch > ONNX > CoreML > TFLite

  12. YOLOX

    YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

    Project mention: Extracting Pedestrian Trajectories from Street Video as JSON | dev.to | 2026-03-14

    # Required packages pip install opencv-python numpy onnxruntime # Download YOLOX-Tiny ONNX model # From: https://github.com/Megvii-BaseDetection/YOLOX

  13. autogluon

    Fast and Accurate ML in 3 Lines of Code

    Project mention: Gluon: a GPU programming language based on the same compiler stack as Triton | news.ycombinator.com | 2025-09-17

    Amazon (+ Microsoft) already released a language for ML called gluon 8 years ago: https://aws.amazon.com/blogs/aws/introducing-gluon-a-new-lib...

    autogluon is popular as well: https://github.com/autogluon/autogluon

  14. GroundingDINO

    [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

  15. X-AnyLabeling

    Effortless data labeling with AI support from Segment Anything and other awesome models.

    Project mention: Just added SAM3 video object tracking to X-AnyLabeling! | dev.to | 2026-01-03

    Setup guide: https://github.com/CVHub520/X-AnyLabeling/blob/main/examples/interactive_video_object_segmentation/sam3/README.md

  16. ImageAI

    A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

  17. rf-detr

    RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning. [ICLR 2026]

    Project mention: RF-DETR vs YOLO vs Cloud API: Which Should You Actually Use in 2026? | dev.to | 2026-03-20

    RF-DETR GitHub (Apache 2.0)

  18. TensorLayer

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

  19. mmdetection3d

    OpenMMLab's next-generation platform for general 3D object detection.

  20. darkflow

    Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

  21. gluon-cv

    Gluon CV Toolkit

    Project mention: Gluon: a GPU programming language based on the same compiler stack as Triton | news.ycombinator.com | 2025-09-17
  22. layout-parser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

  23. sahi

    Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

    Project mention: From 9 Tiles to 900: Scaling Computer Vision Pipelines | dev.to | 2026-06-04

    Tiled inference isn't a niche technique. It's the industry standard for any image that exceeds a model's input constraints. SAHI (Slicing Aided Hyper Inference) has over 35,000 stars on GitHub. It partitions images into overlapping slices, runs detection on each slice, and stitches results together. Digital pathology pipelines routinely tile gigapixel whole-slide images into thousands of patches for parallel inference. Satellite imagery processing architectures on AWS all involve the same core pattern: tile, infer in parallel, aggregate.

  24. AdelaiDet

    AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python object-detection discussion

Log in or Post with

Python object-detection related posts

  • From 9 Tiles to 900: Scaling Computer Vision Pipelines

    2 projects | dev.to | 4 Jun 2026
  • RF-DETR vs YOLO vs Cloud API: Which Should You Actually Use in 2026?

    1 project | dev.to | 20 Mar 2026
  • Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi

    2 projects | dev.to | 23 Nov 2025
  • Segment Anything 3

    2 projects | news.ycombinator.com | 19 Nov 2025
  • Labellerr YOLOv8: Cars and Number Plate Detection — Practical, Step-by-Step

    1 project | dev.to | 5 Nov 2025
  • Gluon: a GPU programming language based on the same compiler stack as Triton

    7 projects | news.ycombinator.com | 17 Sep 2025
  • Introdução a Visão Computacional - Parte 2: Detecção de Objetos com YOLO 11

    2 projects | dev.to | 8 Jun 2025
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source object-detection projects in Python? This list will help you:

# Project Stars
1 ultralytics 58,307
2 yolov5 57,519
3 supervision 43,984
4 mmdetection 32,695
5 Mask_RCNN 25,564
6 Swin-Transformer 15,857
7 PaddleDetection 14,241
8 pytorch-grad-cam 12,860
9 fiftyone 10,764
10 yolov3 10,571
11 YOLOX 10,483
12 autogluon 10,466
13 GroundingDINO 10,063
14 X-AnyLabeling 9,398
15 ImageAI 8,867
16 rf-detr 7,688
17 TensorLayer 7,389
18 mmdetection3d 6,353
19 darkflow 6,147
20 gluon-cv 5,922
21 layout-parser 5,730
22 sahi 5,344
23 AdelaiDet 3,480

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?