Python object-detection

Open-source Python projects categorized as object-detection

Top 23 Python object-detection Projects

  • yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

    Project mention: How would i go about having YOLO v5 return me a list from left to right of all detected objects in an image? | /r/computervision | 2023-11-13
  • mmdetection

    OpenMMLab Detection Toolbox and Benchmark

    Project mention: Semantic segementation | /r/computervision | 2023-04-12

    When I look for benchmarks I always start here it has the lists of datasets to measure models accross lots o papers. Many are very specific models with low support or community but it gives you a good idea of ​​the state of the art. It also lists repositories related to good community. seems very active and the one that is being used the most, you could use the models that it has integrated in its model zoo, within the same repository. It has the benchmarks to compare those same models and some of them are from 2022


    Learn 300+ open source libraries for free using AI. LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!

  • Mask_RCNN

    Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

    Project mention: Intuituvely Understanding Harris Corner Detector | | 2023-09-11

    The most widely used algorithms for classical feature detection today are "whatever opencv implements"

    In terms of tech that's advancing at the moment? if you want to track individual points, and its descendents if you want to detect, say, the cover of a book.

  • ultralytics

    NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

    Project mention: The CEO of Ultralytics (yolov8) using LLMs to engage with commenters on GitHub | | 2024-02-12

    Yep, I noticed this a while ago. It posts easily identifiable ChatGPT responses. It also posts garbage wrong answers which makes it worse than useless. Totally disrespectful to the userbase.

  • frigate

    NVR with realtime local object detection for IP cameras

    Project mention: Picking between two cameras hikvision vs dahua. Both 4MP 1/1.8" turrets. | /r/homesecurity | 2023-12-09

    Am in to selfhosting and homeserver, finally got to try Frigate with some aliexpress camera that was not mine. Love it.

  • albumentations

    Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: Paper about the library:

    Project mention: Augment specific classes? | /r/computervision | 2023-12-06

    You can use albumentations if you are comfortable with using open source libraries

  • Swin-Transformer

    This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

    Project mention: Samsung expected to report 80% profit plunge as losses mount at chip business | | 2023-10-10

    > there is really nothing that "normal" AI requires that is bound to CUDA. pyTorch and Tensorflow are backend agnostic (ideally...).

    There are a lot of optimizations that CUDA has that are nowhere near supported in other software or even hardware. Custom cuda kernels also aren't as rare as one might think, they will often just be hidden unless you're looking at libraries. Our more well known example is going to be StyleGAN[0] but it isn't uncommon to see elsewhere, even in research code. Swin even has a cuda kernel[1]. Or find torch here[1] (which github reports that 4% of the code is cuda (and 42% C++ and 2% C)). These things are everywhere. I don't think pytorch and tensorflow could ever be agnostic, there will always be a difference just because you have to spend resources differently (developing kernels is time resource). We can draw evidence by looking at Intel MKL, which is still better than open source libraries and has been so for a long time.

    I really do want AMD to compete in this space. I'd even love a third player like Intel. We really do need competition here, but it would be naive to think that there's going to be a quick catchup here. AMD has a lot of work to do and posting a few bounties and starting a company (idk, called "micro grad"?) isn't going to solve the problem anytime soon.

    And fwiw, I'm willing to bet that most AI companies would rather run in house servers than from cloud service providers. The truth is that right now just publishing is extremely correlated to compute infrastructure (doesn't need to be but with all the noise we've just said "fuck the poor" because rejecting is easy) and anyone building products has costly infrastructure.




  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • PaddleDetection

    Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

    Project mention: [R]DETRs Beat YOLOs on Real-time Object Detection | /r/MachineLearning | 2023-04-19

    Our RTDETR-L achieves 53.0% AP on COCO val2017 and 114 FPS on T4 GPU, while RT-DETR-X achieves 54.8% AP and 74 FPS, outperforming all YOLO detectors of the same scale in both speed and accuracy. Furthermore, our RTDETR-R50 achieves 53.1% AP and 108 FPS, outperforming DINO-Deformable-DETR->R50 by 2.2% AP in accuracy and by about 21 times in FPS. Source code and pretrained models will be available at PaddleDetection1 ( .

  • yolov3

    YOLOv3 in PyTorch > ONNX > CoreML > TFLite

  • supervision

    We write your reusable computer vision tools. 💜

    Project mention: Ask HN: Who is hiring? (February 2024) | | 2024-02-01

    Roboflow | Open Source Software Engineer, Web Designer / Developer, and more. | Full-time (Remote, SF, NYC) |

    Roboflow is the fastest way to use computer vision in production. We help developers give their software the sense of sight. Our end-to-end platform[1] provides tooling for image collection, annotation, dataset exploration and curation, training, and deployment.

    Over 250k engineers (including engineers from 2/3 Fortune 100 companies) build with Roboflow. We now host the largest collection of open source computer vision datasets and pre-trained models[2]. We are pushing forward the CV ecosystem with open source projects like Autodistill[3] and Supervision[4]. And we've built one of the most comprehensive resources for software engineers to learn to use computer vision with our popular blog[5] and YouTube channel[6].

    We have several openings available but are primarily looking for strong technical generalists who want to help us democratize computer vision and like to wear many hats and have an outsized impact. Our engineering culture is built on a foundation of autonomy & we don't consider an engineer fully ramped until they can "choose their own loss function". At Roboflow, engineers aren't just responsible for building things but also for helping us figure out what we should build next. We're builders & problem solvers; not just coders. (For this reason we also especially love hiring past and future founders.)

    We're currently hiring full-stack engineers for our ML and web platform teams, a web developer to bridge our product and marketing teams, several technical roles on the sales & field engineering teams, and our first applied machine learning researcher to help push forward the state of the art in computer vision.







  • pytorch-grad-cam

    Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

    Project mention: Exploring GradCam and More with FiftyOne | | 2024-02-13

    For the two examples we will be looking at, we will be using pytorch_grad_cam, an incredible open source package that makes working with GradCam very easy. There are excellent other tutorials to check out on the repo as well.


    YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation:

    Project mention: Learning Exchange, lets training YoloX | /r/deeplearning | 2023-03-01

    So I am trying to do my best and train YOLOX for an object detection case using Google Colab.

  • ImageAI

    A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

    Project mention: Photo gallery frontend with encryption and search | /r/selfhosted | 2023-11-27

    Hi. I want to implement an image server similar to Photoprism using ImageAI to tag images based on objects and context. However I don't want to spend to much time working on the frontend, at first I were thinking about using Danbooru and use Flexbooru or the web interface on my phone. But it doesn't have any encryption or password protection (since the purpose of it is to be used as a public image board).

  • TensorLayer

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

  • autogluon

    AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data

    Project mention: pip install remyxai - easiest way to create custom vision models | /r/computervision | 2023-04-25

    This seems not very convincing. There are other popular frameworks that provide AutoML with existing datasets (eg

  • darkflow

    Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

    Project mention: FOSS self-hosted image-to-text gpu accelerated object recognition ? Is there anything on the table yet ? | /r/selfhosted | 2023-03-30

  • gluon-cv

    Gluon CV Toolkit

    Project mention: FOSS self-hosted image-to-text gpu accelerated object recognition ? Is there anything on the table yet ? | /r/selfhosted | 2023-03-30

  • mmdetection3d

    OpenMMLab's next-generation platform for general 3D object detection.

    Project mention: What's the best model to get monocular 3d angle info | /r/deeplearning | 2023-06-28

    There are bunch of methods in this codebase, check it out.

  • GroundingDINO

    Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

    Project mention: Autodistill: A new way to create CV models | /r/developersIndia | 2023-09-30

    Some of the foundation/base models include: * GroundedSAM (Segment Anything Model) * DETIC * GroundingDINO

  • layout-parser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

  • sahi

    Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

    Project mention: Small-Object Detection using YOLOv8 | /r/computervision | 2023-08-15

    Hi All, I am trying to detect defects in the images using YOLOv8where some of the classes (defectType1, defectType2) have very small bounding boxes and some of them have large bounding boxes associated with the, (defectType3, defectType4). Also, real-time operation is desired (at least 5Hz on Jetson Xavier) What I have done till now: I am primarily trying to use the SAHI technique (Slicing Aided Hyper Inference)

  • AdelaiDet

    AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

    Project mention: FOSS self-hosted image-to-text gpu accelerated object recognition ? Is there anything on the table yet ? | /r/selfhosted | 2023-03-30

  • catalyst

    Accelerated deep learning R&D (by catalyst-team)

    Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09
  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-13.

Python object-detection related posts


What are some of the best open-source object-detection projects in Python? This list will help you:

Project Stars
1 yolov5 45,215
2 mmdetection 27,038
3 Mask_RCNN 23,932
4 ultralytics 19,570
5 frigate 13,705
6 albumentations 13,141
7 Swin-Transformer 12,455
8 PaddleDetection 11,799
9 yolov3 9,891
10 supervision 9,644
11 pytorch-grad-cam 8,991
12 YOLOX 8,864
13 ImageAI 8,302
14 TensorLayer 7,275
15 autogluon 6,832
16 darkflow 6,132
17 gluon-cv 5,698
18 mmdetection3d 4,557
19 GroundingDINO 4,340
20 layout-parser 4,293
21 sahi 3,398
22 AdelaiDet 3,296
23 catalyst 3,201
The modern API for authentication & user identity.
The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.