SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Image processing Projects
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
-
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
-
darkflow
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
-
image-super-resolution
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
-
geemap
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
-
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
-
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.
You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations
Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29Pillow (https://python-pillow.org/)
Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models
Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.
We will use the Hugging Face transformers and diffusers libraries for inference, FiftyOne for data management and visualization, and scikit-image for evaluation metrics.
Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.
Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09
Project mention: I'm a senior in my CS major and it's incredible I didn't hear about GIS projects until now. Glad to be here. | /r/gis | 2023-05-22Try out Google Earth Engine and browse through it's catalogue to get a feel for what's available. GEE allows you to work with global datasets and immediately see a preview of the results (there's also geemap if you prefer doing this from a Python notebook instead of the online JS editor)
Project mention: VidCutter: A program for lossless video cutting | news.ycombinator.com | 2023-08-20If you mean scene changes, this library works: https://github.com/Breakthrough/PySceneDetect
Python Image processing related posts
-
Supervision – reusable computer vision tools
-
Rembg: Tool to Remove Images Background
-
Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
-
Instance segmentation of small objects in grainy drone imagery
-
🚀 Background Removal in Python with PyTorch and Rembg! 🎨🐍
-
Terminal program to change colors
-
Batch-processing images by folder on ComfyUI
-
A note from our sponsor - SaaSHub
www.saashub.com | 7 May 2024
Index
What are some of the best open-source Image processing projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | EasyOCR | 22,049 |
2 | rembg | 14,628 |
3 | supervision | 14,068 |
4 | albumentations | 13,451 |
5 | OCRmyPDF | 12,067 |
6 | pillow | 11,722 |
7 | LaTeX-OCR | 10,860 |
8 | Kornia | 9,429 |
9 | segmentation_models.pytorch | 8,844 |
10 | google-images-download | 8,499 |
11 | U-2-Net | 8,134 |
12 | deeplake | 7,729 |
13 | darkflow | 6,128 |
14 | scikit-image | 5,880 |
15 | blind_watermark | 5,314 |
16 | image-super-resolution | 4,505 |
17 | manga-image-translator | 4,239 |
18 | Crunch | 3,325 |
19 | catalyst | 3,229 |
20 | geemap | 3,207 |
21 | towhee | 3,001 |
22 | PySceneDetect | 2,812 |
23 | SimpleCV | 2,659 |
Sponsored