SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Image processing Projects
-
Project mention: Show HN: Plug-and-play Python utils for any computer-vision pipeline | news.ycombinator.com | 2025-07-21
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Turn Scanned PDFs into OCR'd Text via Email (Supports English and Dutch) | news.ycombinator.com | 2026-01-05
We wanted a lightweight solution for people NGOs and organizations that deal with scanned documents but don’t want to manage OCR software locally. This service uses https://ocrmypdf.readthedocs.io/ under the hood.
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Project mention: EasyOCR2 with rust speed. 5X-10x faster than EasyOCR | news.ycombinator.com | 2025-12-28 -
-
-
Project mention: Sharp Bilinear Filters: Big Clean Pixels for Pixel Art | news.ycombinator.com | 2025-10-16
> that to the display resolution using bilinear
On that topic, Pillow so-called binilnear isn't actually bilinear interpolation [1][2], same with Magick IIRC (but Magick at least gives you -define filter:blur= to counteract this)
[1] https://pillow.readthedocs.io/en/stable/releasenotes/2.7.0.h...
[2] https://github.com/python-pillow/Pillow/blob/main/src/libIma...
-
-
segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
-
In vmstat, watch r, wa, bi, and us plus sy together. r is runnable processes, and if it stays above your CPU core count, the CPU is saturated. wa is CPU time waiting on I/O, and sustained values above 10 to 15 during training often mean dataloader workers are blocked on disk reads. bi is blocks received from storage, and high bi with high wa points to storage bottlenecks instead of compute. us + sy is total active CPU time, and if it is high while GPU-Util is low, preprocessing is outrunning the GPU. If wa is high, increase dataloader workers or switch to faster storage. If us + sy is high with low GPU-Util, move transforms to GPU with a library such as Kornia.
-
manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
-
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
-
scikit-image
-
darkflow
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
-
sketch-code
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
-
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
Scene detection uses PySceneDetect with its ContentDetector, which analyzes frame-to-frame pixel differences to find hard cuts. DV camcorder footage has clean cuts between scenes (you literally pressed the record button), so this works well out of the box — but the defaults produced too many tiny micro-scenes for my taste, with every pan or exposure change spawning its own clip. After some experimentation I settled on --threshold 45 (a bit less sensitive than the default) and --min-scene-len 20 (ignore any "scene" shorter than 20 seconds), which matched how I actually wanted to browse the footage.
-
geemap
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
-
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
-
-
-
-
-
pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
-
Python Image processing discussion
Python Image processing related posts
-
remove-ai-watermarks: una CLI borra SynthID, C2PA y el sparkle de Gemini
-
The creative software industry has declared war on Adobe
-
A Complete Guide to Real-Time GPU Usage Monitoring
-
DeepInverse: PyTorch Library for Inverse Problems
-
Turn Scanned PDFs into OCR'd Text via Email (Supports English and Dutch)
-
Rembg: Remove Image Backgrounds
-
Show HN: Fixing Google Nano Banana Pixel Art with Rust
-
A note from our sponsor - SaaSHub
www.saashub.com | 6 Jun 2026
Index
What are some of the best open-source Image processing projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | supervision | 40,043 |
| 2 | OCRmyPDF | 33,782 |
| 3 | EasyOCR | 29,547 |
| 4 | rembg | 23,186 |
| 5 | LaTeX-OCR | 16,324 |
| 6 | pillow | 13,596 |
| 7 | blind_watermark | 12,940 |
| 8 | segmentation_models.pytorch | 11,596 |
| 9 | Kornia | 11,227 |
| 10 | manga-image-translator | 9,969 |
| 11 | U-2-Net | 9,680 |
| 12 | scikit-image | 6,522 |
| 13 | darkflow | 6,147 |
| 14 | sketch-code | 5,157 |
| 15 | PySceneDetect | 4,892 |
| 16 | geemap | 3,956 |
| 17 | towhee | 3,446 |
| 18 | Crunch | 3,429 |
| 19 | catalyst | 3,378 |
| 20 | SimpleCV | 2,729 |
| 21 | pywt | 2,373 |
| 22 | pdftabextract | 2,258 |
| 23 | pytorch-ssim | 1,947 |