Python Image processing

Open-source Python projects categorized as Image processing

Top 23 Python Image processing Projects

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  • Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

    PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • rembg

    Rembg is a tool to remove images background

  • Project mention: Rembg: Tool to Remove Images Background | news.ycombinator.com | 2024-03-19
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • supervision

    We write your reusable computer vision tools. 💜

  • Project mention: Supervision: Reusable Computer Vision | news.ycombinator.com | 2024-03-24

    You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.

  • albumentations

    Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

  • Project mention: Augment specific classes? | /r/computervision | 2023-12-06

    You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  • Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14

    Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.

  • pillow

    Python Imaging Library (Fork)

  • Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29

    Pillow (https://python-pillow.org/)

  • LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

  • Project mention: Detexify LaTeX Handwriting Symbol Recognition | news.ycombinator.com | 2023-11-14
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • Kornia

    Geometric Computer Vision Library for Spatial AI

  • segmentation_models.pytorch

    Segmentation models with pretrained backbones. PyTorch.

  • Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09

    Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models

  • google-images-download

    Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!

  • U-2-Net

    The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

  • Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17

    I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.

  • deeplake

    Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

  • Project mention: FLaNK AI Weekly 25 March 2025 | dev.to | 2024-03-25
  • darkflow

    Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

  • scikit-image

    Image processing in Python

  • Project mention: How to Estimate Depth from a Single Image | dev.to | 2024-04-25

    We will use the Hugging Face transformers and diffusers libraries for inference, FiftyOne for data management and visualization, and scikit-image for evaluation metrics.

  • blind_watermark

    Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!

  • image-super-resolution

    🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

  • manga-image-translator

    Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

  • Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06

    OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.

  • Crunch

    Insane(ly slow but wicked good) PNG image optimization (by chrissimpkins)

  • catalyst

    Accelerated deep learning R&D (by catalyst-team)

  • Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09
  • geemap

    A Python package for interactive geospatial analysis and visualization with Google Earth Engine.

  • Project mention: I'm a senior in my CS major and it's incredible I didn't hear about GIS projects until now. Glad to be here. | /r/gis | 2023-05-22

    Try out Google Earth Engine and browse through it's catalogue to get a feel for what's available. GEE allows you to work with global datasets and immediately see a preview of the results (there's also geemap if you prefer doing this from a Python notebook instead of the online JS editor)

  • towhee

    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

  • Project mention: FLaNK Stack Weekly for 14 Aug 2023 | dev.to | 2023-08-14
  • PySceneDetect

    :movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

  • Project mention: VidCutter: A program for lossless video cutting | news.ycombinator.com | 2023-08-20

    If you mean scene changes, this library works: https://github.com/Breakthrough/PySceneDetect

  • SimpleCV

    The Open Source Framework for Machine Vision

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Image processing related posts

  • Supervision – reusable computer vision tools

    1 project | news.ycombinator.com | 20 Mar 2024
  • Rembg: Tool to Remove Images Background

    1 project | news.ycombinator.com | 19 Mar 2024
  • Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide

    5 projects | dev.to | 27 Dec 2023
  • Instance segmentation of small objects in grainy drone imagery

    8 projects | /r/computervision | 9 Dec 2023
  • 🚀 Background Removal in Python with PyTorch and Rembg! 🎨🐍

    2 projects | /r/programming | 9 Dec 2023
  • Terminal program to change colors

    1 project | /r/dwm | 8 Dec 2023
  • Batch-processing images by folder on ComfyUI

    1 project | /r/StableDiffusion | 8 Dec 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 7 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Image processing projects in Python? This list will help you:

Project Stars
1 EasyOCR 22,049
2 rembg 14,628
3 supervision 14,068
4 albumentations 13,451
5 OCRmyPDF 12,067
6 pillow 11,722
7 LaTeX-OCR 10,860
8 Kornia 9,429
9 segmentation_models.pytorch 8,844
10 google-images-download 8,499
11 U-2-Net 8,134
12 deeplake 7,729
13 darkflow 6,128
14 scikit-image 5,880
15 blind_watermark 5,314
16 image-super-resolution 4,505
17 manga-image-translator 4,239
18 Crunch 3,325
19 catalyst 3,229
20 geemap 3,207
21 towhee 3,001
22 PySceneDetect 2,812
23 SimpleCV 2,659

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com