Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more โ
Top 23 Python object-detection Projects
-
There are several implementations of the YOLO algorithm available, but for ease-of-use, we will use the Ultralytics implementation in this guide. We will implement and test the code locally and then deploy to Koyeb's GPUs for higher inference speed.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
Project mention: Show HN: Using YOLO to Detect Office Chairs in 40M Hotel Photos | news.ycombinator.com | 2025-01-25
They did it on their own computer. https://github.com/ultralytics/ultralytics
-
-
Project mention: Octopuses seen hunting together with fish, punching those that don't cooperate | news.ycombinator.com | 2024-09-26
The study: https://www.nature.com/articles/s41559-024-02525-2
Sampaio et al., Multidimensional social influence drives leadership and composition-dependent success in octopusโfish hunting groups
I was curious about the the 3D reconstruction of the scenes.
> We manually tracked individuals in the videos using the software Computer Vision Annotation Tool. We annotated three frames per second, which yielded a time resolution of 0.33 s for animal movement.
> We then used another software developed to incorporate the previously tracked animals in each camera in the โcolmapโ habitat models and camera paths, โmultiviewtracksโ or โmvtโ [29]
> [29] Francisco, F. A., Nรผhrenberg, P. & Jordan, A. High-resolution, non-invasive animal tracking and reconstruction of local environment in aquatic ecosystems
Seems it was specifically developed to track fish. Cool project.
Links to https://github.com/matterport/Mask_RCNN and https://github.com/pnuehrenberg/multiviewtracks.
-
-
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Project mention: Albumentations: Fastest and most flexible image augmentation library | news.ycombinator.com | 2024-11-30 -
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
-
Nutrient
Nutrient โ The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrientโs SDK handles billion-page workloads - so you donโt have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
-
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
-
pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
-
-
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
-
But the challenge lies in organizing and managing this unstructured data. Thatโs where tools like FiftyOne, Voxel 51โs open-source dataset management tool, come in.
-
ImageAI
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
-
๐ Composed Image Retrieval ๐ Intro to Multimodal LLama 3.2 ๐ ๏ธ Multi Agent Concierge ๐ป RAG with Langchain Granite, Milvus ๐ซถ Download content โ Transformer Replacement? ๐ค vLLM for runing models ๐ Amphion ๐ Autogluon ๐ Notebook LLama like Google's Notebook LLM ๐ซถ Monocle2ai for tracing GenAI app code LFA&D Project ๐ค Bee Agent Framework โ LLama RFP Response โถ๏ธ GenAI Script ๐ฝ Simular AI Agent S ๐ฆพ DrawDB with AI โจ Ollama with LLama 3.2 Vision!!!! Preview ๐ Powerful RAG Checker ๐ SQL Generator ๐ป Role of LLMs ๐ Document Extraction ๐ถ๏ธ Open Source Vector DB Reddit ๐ The Practical Guide to Self Hosting LLM ๐ฆพ Stagehand Controller ๐ถ๏ธ Understanding HNSWLIB ๐ Best practices in RAG ๐ป Enigma Agent ๐ Langchain, Ollama, Phi3 for Function Calling ๐ Compass Judger ๐ Princeton NLP SimPO ๐ Princeton NLP ProLong ๐ Princeton NLP HELMET ๐ง Ollama Cheatsheet ๐ Princeton NLP CopyCat ๐ Princeton NLP Shp ๐ถ๏ธ Can LLM Solve Hard Github Issues ๐ Enabling Large Language Models to Generate Text with Citations ๐ Princeton NLP CharXiv ๐ Awesome AI Agents List ๐ฆพ Nomicโs Matryoshka text embedding model
-
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
-
-
darkflow
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
-
-
-
Project mention: Ask HN: What are you using to parse PDFs for RAG? | news.ycombinator.com | 2024-07-30
-
An alternative to this is to leverage existing object detection, apply the model to patches or slices of fixed size in our image, and then stitch the results together. This is the idea behind Slicing-Aided Hyper Inference!
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python object-detection discussion
Python object-detection related posts
-
GeoDeep: AI object detection and semantic segmentation in geospatial rasters
-
Show HN: Using YOLO to Detect Office Chairs in 40M Hotel Photos
-
Ask HN: Who is hiring? (December 2024)
-
Octopuses seen hunting together with fish, punching those that don't cooperate
-
Show HN: I am using AI to drop hats outside my window onto New Yorkers
-
YOLOv5 on FPGA with Hailo-8 and 4 Pi Cameras
-
Mastering YOLOv10: A Complete Guide with Hands-On Projects
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 14 Feb 2025
Index
What are some of the best open-source object-detection projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | yolov5 | 52,325 |
2 | ultralytics | 36,380 |
3 | mmdetection | 30,192 |
4 | Mask_RCNN | 24,907 |
5 | supervision | 24,868 |
6 | albumentations | 14,562 |
7 | Swin-Transformer | 14,267 |
8 | PaddleDetection | 13,048 |
9 | pytorch-grad-cam | 11,068 |
10 | yolov3 | 10,299 |
11 | YOLOX | 9,631 |
12 | fiftyone | 9,146 |
13 | ImageAI | 8,722 |
14 | autogluon | 8,345 |
15 | GroundingDINO | 7,356 |
16 | TensorLayer | 7,338 |
17 | darkflow | 6,137 |
18 | gluon-cv | 5,864 |
19 | mmdetection3d | 5,493 |
20 | layout-parser | 5,062 |
21 | sahi | 4,284 |
22 | AdelaiDet | 3,416 |
23 | catalyst | 3,322 |