SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Computer Vision Projects
-
Project mention: Transforming Unstructured Retail Catalogs into Structured Data using AI | dev.to | 2026-04-10
Before reading any text, we run the raw catalog pages through a custom object detection model (based on YOLO architecture). This model is trained to identify the bounding boxes of individual product regions, allowing us to crop the giant page into smaller, isolated product images.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Show HN: Real-time privacy protection for smart glasses | news.ycombinator.com | 2025-08-11
Did you look at egoblur? its a lot more effective at face detection than https://github.com/ageitgey/face_recognition granted, you'd have to do your own face matching to do exception.
-
Project mention: Benchmarking ASR & Essential Open-Source CV Tools for Local AI | dev.to | 2026-06-09
Source: https://github.com/roboflow/supervision
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Project mention: EasyOCR2 with rust speed. 5X-10x faster than EasyOCR | news.ycombinator.com | 2025-12-28 -
d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
-
-
gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Project mention: Gaussian Splatting – A$AP Rocky Helicopter Music Video | news.ycombinator.com | 2026-01-18This 2-minute video is a great intro to the topic https://www.youtube.com/watch?v=HVv_IQKlafQ
I think this tech has become "production-ready" recently due to a combination of research progress (the seminal paper was published in 2023 https://repo-sam.inria.fr/fungraph/3d-gaussian-splatting/) and improvements to GPU hardware/libraries.
-
datasets
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Project mention: GSoC 2026 Predictions: 30 NEW AI/ML/Security Organizations You Should Start Contributing to NOW! | dev.to | 2026-02-06 -
-
labelme
Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.
-
-
OpenCLIP: Open Source Implementation
-
Project mention: One Open Source Project per Day #74: ai-engineering-from-scratch - Build AI Full-stack Skills from Ground Up | dev.to | 2026-05-23
git clone https://github.com/rohitg00/ai-engineering-from-scratch.git cd ai-engineering-from-scratch python phases/01-math-foundations/01-linear-algebra-intuition/code/vectors.py
-
pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
-
Project mention: OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision | news.ycombinator.com | 2026-06-10
-
-
-
-
segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
-
In vmstat, watch r, wa, bi, and us plus sy together. r is runnable processes, and if it stays above your CPU core count, the CPU is saturated. wa is CPU time waiting on I/O, and sustained values above 10 to 15 during training often mean dataloader workers are blocked on disk reads. bi is blocks received from storage, and high bi with high wa points to storage bottlenecks instead of compute. us + sy is total active CPU time, and if it is high while GPU-Util is low, preprocessing is outrunning the GPU. If wa is high, increase dataloader workers or switch to faster storage. If us + sy is high with low GPU-Util, move transforms to GPU with a library such as Kornia.
-
-
Project mention: Gluon: a GPU programming language based on the same compiler stack as Triton | news.ycombinator.com | 2025-09-17
Amazon (+ Microsoft) already released a language for ML called gluon 8 years ago: https://aws.amazon.com/blogs/aws/introducing-gluon-a-new-lib...
autogluon is popular as well: https://github.com/autogluon/autogluon
-
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Python Computer Vision discussion
Python Computer Vision related posts
-
From 9 Tiles to 900: Scaling Computer Vision Pipelines
-
Gaussian Point Splatting
-
Sustainable AI Starts with Efficient AI
-
remove-ai-watermarks: una CLI borra SynthID, C2PA y el sparkle de Gemini
-
A Complete Guide to Real-Time GPU Usage Monitoring
-
RF-DETR vs YOLO vs Cloud API: Which Should You Actually Use in 2026?
-
It Took Me 30 Years to Solve This VFX Problem – Green Screen Problem [video]
-
A note from our sponsor - SaaSHub
www.saashub.com | 14 Jun 2026
Index
What are some of the best open-source Computer Vision projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | ultralytics | 58,307 |
| 2 | Face Recognition | 56,402 |
| 3 | supervision | 43,984 |
| 4 | EasyOCR | 29,592 |
| 5 | d2l-en | 28,853 |
| 6 | pytorch-CycleGAN-and-pix2pix | 25,151 |
| 7 | gaussian-splatting | 22,330 |
| 8 | datasets | 21,620 |
| 9 | vision | 17,733 |
| 10 | labelme | 15,954 |
| 11 | facenet | 14,317 |
| 12 | open_clip | 13,904 |
| 13 | ai-engineering-from-scratch | 13,774 |
| 14 | pytorch-grad-cam | 12,860 |
| 15 | Meshroom | 12,788 |
| 16 | fashion-mnist | 12,741 |
| 17 | ludwig | 11,716 |
| 18 | nerfstudio | 11,683 |
| 19 | segmentation_models.pytorch | 11,609 |
| 20 | Kornia | 11,240 |
| 21 | fiftyone | 10,776 |
| 22 | autogluon | 10,466 |
| 23 | U-2-Net | 9,680 |