Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues. Learn more →
Top 23 Python Image processing Projects
-
Project mention: 13 GitHub Projects that Supercharge Your AI and Development Journey 🚀 | dev.to | 2025-03-03
Stars: 19899 Author: ocrmypdf Star the OCRmyPDF repository⭐
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
Project mention: Replacing Only the Background of an Image with AI Generation Using the Stable Diffusion Web API | dev.to | 2024-11-08
Another approach for background removal is to use rembg. This method requires only one API call, making it more cost-effective, though it may result in differences in extraction accuracy.
-
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Project mention: Albumentations: Fast and flexible image augmentation library | news.ycombinator.com | 2025-02-22 -
Project mention: Show HN: Synthesize TikZ Graphics Programs for Scientific Figures and Sketches | news.ycombinator.com | 2024-06-06
already claim to (at least partially) support this.
[1] https://github.com/lukas-blecher/LaTeX-OCR
-
Project mention: Release Radar · October 2024: Major updates from the open source community | dev.to | 2024-11-01
One problem when it comes to images and coding, is computer programs don't read images in the same way as code (ie. text). Instead, images need to be converted or interpreted. That's where Pillow comes in. It's an image processing library for Python interpreters, with extensive file support. The latest update drops support for EOL Python 3.8, now only supporting Python 3.12 or later. The Pillow C API is now faster, resulting in the removal of PyAccess, and only using PyPy. There's also support for WebP 0.5 as a minimum and internal PSFile class has been removed. Check out all the changes in the Pillow release notes.
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
-
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
-
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Finally, we stored these vectors in our chosen database: the activeloop DeepLake database. This database is open source, something near and dear to our own open-source hearts. We will cover some additional details in a further section, but it is specifically designed to handle vector data and perform efficient similarity searches, which is crucial for quick and accurate retrieval during the RAG process.
-
-
-
-
darkflow
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
-
sketch-code
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
-
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
thought the same thing and wrote this: https://blog.mixpeek.com/dynamic-video-chunking-scene-detect...
which uses https://github.com/Breakthrough/PySceneDetect
under the hood i'm sure it's the same ffmpeg method ;)
-
geemap
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
-
-
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
-
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
Python Image processing discussion
Python Image processing related posts
-
Mistral OCR
-
Ask HN: What is the best method for turning a scanned book as a PDF into text?
-
Liberate tabular data from scanned documents
-
Llama-OCR: An Open-Source Llama 3.2 Based OCR Tool
-
Replacing Only the Background of an Image with AI Generation Using the Stable Diffusion Web API
-
A return to hand-written notes by learning to read and write
-
Ask HN: What's the most creative 'useless' program you've ever written?
-
A note from our sponsor - Judoscale
judoscale.com | 29 Apr 2025
Index
What are some of the best open-source Image processing projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | OCRmyPDF | 28,226 |
2 | supervision | 26,491 |
3 | EasyOCR | 26,463 |
4 | rembg | 18,765 |
5 | albumentations | 14,853 |
6 | LaTeX-OCR | 14,200 |
7 | pillow | 12,735 |
8 | Kornia | 10,419 |
9 | segmentation_models.pytorch | 10,365 |
10 | U-2-Net | 9,038 |
11 | google-images-download | 8,612 |
12 | deeplake | 8,556 |
13 | manga-image-translator | 7,373 |
14 | blind_watermark | 6,366 |
15 | scikit-image | 6,233 |
16 | darkflow | 6,146 |
17 | sketch-code | 5,135 |
18 | PySceneDetect | 3,825 |
19 | geemap | 3,639 |
20 | Crunch | 3,385 |
21 | towhee | 3,359 |
22 | catalyst | 3,336 |
23 | SimpleCV | 2,718 |