Python Image processing

Open-source Python projects categorized as Image processing

Top 23 Python Image processing Projects

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: I made a website for a friend who owns a restaurant. He's wondering if there's a way to upload a picture of his menu daily. What is the best way to do this? | | 2023-01-15
  • albumentations

    Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: Paper about the library:

    Project mention: Unstable Diffusion bounces back with $19,000 raised in one day, by using Stripe | | 2022-12-23

    I think they should use some data augmentation techniques like I am using for Infinity AI if you wanna see more here. Note that most of these do not work for image generation.

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • pillow

    Python Imaging Library (Fork)

    Project mention: Add Watermarks To PDF, JPG & PNG files with no restrictive licensing | | 2023-01-20

    Pillow -

  • google-images-download

    Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

    Project mention: Cost aside, what is the most accurate OCR app? | | 2022-12-22

    I don't know what's the best, but I've had good luck with the command line tool ocrmypdf.

  • rembg

    Rembg is a tool to remove images background.

    Project mention: Game creator powered by Stable Diffusion: Generate images, remove backgrounds, and edit levels all in one tool! | | 2022-12-13

    We use rembg with u2net:

  • Kornia

    Open Source Differentiable Computer Vision Library

    Project mention: [News] Kornia 0.6.6: ParametrizedLine API, load_image support for Apple Windows Developer, integration demos with Hugging Face and many more. | | 2022-07-18


  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.

  • pywal

    🎨 Generate and change color-schemes on the fly.

    Project mention: I created an online tool to generate Vim and Emacs themes from VS Code theme's JSON files. | | 2023-01-18
  • segmentation_models.pytorch

    Segmentation models with pretrained backbones. PyTorch.

    Project mention: Medical Image Segmentation Human Retina | | 2023-01-19

    This basic example from segmentation models PyTorch repo would be good tutorial to start with. The library is very good, I like the unet, fpn and deeplabv3+ architectures with regnety as encoder

  • darkflow

    Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

    Project mention: Does the Haskell client for Selenium still work? | | 2022-06-29

    You could already tell from my earlier comment, but I don't think Python is a good language, for anything really. It is used for AI so that today's hardware performs like hardware we had twenty years ago: Python code has slow execution speed. It is not always the Selenium which is slow. sometimes we need to look at code we are using. And Python is always the slowest programming language out there in terms of performance. It's also just a stupid language that annoys me and it's the same with other people: With Python, I find it a bit too easy to write sloppy code. Haskell on the other hand really forces you to break the problem done and abstract out reusable code. That's not to say that you can't write nice code in Python, just that Haskell doesn't let you get away with doing a lot of stupid stuff. This is equally important for Selenium.

  • scikit-image

    Image processing in Python

    Project mention: Is it possible to add a noise to an image in python? | | 2022-12-15

    This is a good cv deep learning book with python examples If you're pretty comfortable with the concepts of traditional image processing this is a good companion to cv2 (so you don't have to reinvent the wheel)

  • deeplake

    Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow.

    Project mention: Launch HN: Activeloop (YC S18) – Data lake for deep learning | | 2022-11-15

    Re: HF - we know them and admire their work (primarily, until very recently, focused on NLP, while we focus mostly on CV). As mentioned in the post, a large part of Deep Lake, including the Python-based dataloader and dataset format, is open source as well -

    Likewise, we curate a list of large open source datasets here ->, but our main thing isn't aggregating datasets (focus for HF datasets), but rather providing people with a way to manage their data efficiently. That being said, all of the 125+ public datasets we have are available in seconds with one line of code. :)

    We haven't benchmarked against HF datasets in a while, but Deep Lake's dataloader is much, much faster in third-party benchmarks (see this and here for an older version, that was much slower than what we have now, see this: HF under the hood uses Git-LFS (to the best of my knowledge) and is not opinionated on formats, so LAION just dumps Parquet files on their storage.

    While your setup would work for a few TBs, scaling to PB would be tricky including maintaining your own infrastructure. And yep, as you said NAS/NFS would neither be able to handle the scale (especially writes with 1k workers). I am also slightly curious about your use of mmap files with image/video compressed data (as zero-copy won’t happen) unless you decompress inside the GPU ;), but would love to learn more from you! Re: pricing thanks for the feedback, storage is one component and customly priced for PB-scale workloads.

  • image-super-resolution

    🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

    Project mention: A tech worker is selling a children's book he made using AI. Professional illustrators are pissed. | | 2022-12-15
  • LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

    Project mention: Any alternatives to Mathpix/Latex-OCR? | | 2023-01-24


  • blind_watermark

    Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!

    Project mention: 聊聊墙内截图被添加盲水印的事 以及如何去水印 | | 2022-09-08
  • SimpleCV

    The Open Source Framework for Machine Vision

    Project mention: Python for everyone : Mastering Python The Right Way | | 2022-03-07


  • geemap

    A Python package for interactive mapping with Google Earth Engine, ipyleaflet, and ipywidgets.

    Project mention: I'm building an IDE and open source library to make it easier to work with geospatial data using Python | | 2022-11-04
  • pdftabextract

    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

  • PySceneDetect

    :movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

  • towhee

    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

    Project mention: What Is DocArray? | | 2022-10-02

    The description of this is kind of confusing but I think the easiest way to understand it is that it is a data processing pipeline of sorts. Take unstructured data and apply transformation and computation. A similar project to this is Towhee ( This project tries to simplify unstructured data processing and provides pretrained models and pipelines from their hub.

  • pywt

    PyWavelets - Wavelet Transforms in Python

    Project mention: Wavelets: A Mathematical Microscope | | 2022-11-14
  • pytorch-ssim

    pytorch structural similarity (SSIM) loss

  • lightning-bolts

    Toolbox of models, callbacks, and datasets for AI/ML researchers.

    Project mention: Question about implementing RL algorithms | | 2022-11-19

    In the lightning-bolts repository, they implement the different RL algorithms, such as PPO and DQN, as different models. Would it make more sense to have the different algorithms be the Trainer instead? Inside each of the implementations, the model creates the same neural network with different training steps.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-01-24.

Python Image processing related posts


What are some of the best open-source Image processing projects in Python? This list will help you:

Project Stars
1 EasyOCR 16,848
2 albumentations 11,460
3 pillow 10,469
4 google-images-download 8,195
5 OCRmyPDF 8,056
6 rembg 8,004
7 Kornia 7,636
8 pywal 6,955
9 segmentation_models.pytorch 6,694
10 darkflow 6,062
11 scikit-image 5,202
12 deeplake 5,189
13 image-super-resolution 4,010
14 LaTeX-OCR 2,769
15 blind_watermark 2,749
16 SimpleCV 2,594
17 geemap 2,493
18 pdftabextract 2,035
19 PySceneDetect 1,847
20 towhee 1,772
21 pywt 1,588
22 pytorch-ssim 1,562
23 lightning-bolts 1,462
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives