Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work. Learn more →
Top 23 Python Image processing Projects
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Project mention: I made a website for a friend who owns a restaurant. He's wondering if there's a way to upload a picture of his menu daily. What is the best way to do this? | reddit.com/r/learnprogramming | 2023-01-15 -
albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Project mention: Unstable Diffusion bounces back with $19,000 raised in one day, by using Stripe | reddit.com/r/StableDiffusion | 2022-12-23I think they should use some data augmentation techniques like I am using for Infinity AI if you wanna see more here. Note that most of these do not work for image generation.
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Project mention: Add Watermarks To PDF, JPG & PNG files with no restrictive licensing | reddit.com/r/Python | 2023-01-20
Pillow - https://github.com/python-pillow/Pillow
-
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
-
I don't know what's the best, but I've had good luck with the command line tool ocrmypdf. https://github.com/ocrmypdf/OCRmyPDF
-
Project mention: Game creator powered by Stable Diffusion: Generate images, remove backgrounds, and edit levels all in one tool! | reddit.com/r/StableDiffusion | 2022-12-13
We use rembg with u2net: https://github.com/danielgatis/rembg
-
Project mention: [News] Kornia 0.6.6: ParametrizedLine API, load_image support for Apple Windows Developer, integration demos with Hugging Face and many more. | reddit.com/r/MachineLearning | 2022-07-18
👉 https://github.com/kornia/kornia/releases/tag/v0.6.6
-
InfluxDB
Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.
-
Project mention: I created an online tool to generate Vim and Emacs themes from VS Code theme's JSON files. | reddit.com/r/vim | 2023-01-18
-
This basic example from segmentation models PyTorch repo would be good tutorial to start with. The library is very good, I like the unet, fpn and deeplabv3+ architectures with regnety as encoder https://github.com/qubvel/segmentation_models.pytorch/blob/master/examples/binary_segmentation_intro.ipynb
-
darkflow
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
Project mention: Does the Haskell client for Selenium still work? | reddit.com/r/haskell | 2022-06-29You could already tell from my earlier comment, but I don't think Python is a good language, for anything really. It is used for AI so that today's hardware performs like hardware we had twenty years ago: https://github.com/thtrieu/darkflow/issues/904 Python code has slow execution speed. It is not always the Selenium which is slow. sometimes we need to look at code we are using. And Python is always the slowest programming language out there in terms of performance. It's also just a stupid language that annoys me and it's the same with other people: https://medium.com/nerd-for-tech/python-is-a-bad-programming-language-2ab73b0bda5 With Python, I find it a bit too easy to write sloppy code. Haskell on the other hand really forces you to break the problem done and abstract out reusable code. That's not to say that you can't write nice code in Python, just that Haskell doesn't let you get away with doing a lot of stupid stuff. This is equally important for Selenium.
-
Project mention: Is it possible to add a noise to an image in python? | reddit.com/r/learnpython | 2022-12-15
This is a good cv deep learning book with python examples https://www.manning.com/books/deep-learning-for-vision-systems. If you're pretty comfortable with the concepts of traditional image processing this is a good companion to cv2 (so you don't have to reinvent the wheel) https://scikit-image.org/
-
deeplake
Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai
Project mention: Launch HN: Activeloop (YC S18) – Data lake for deep learning | news.ycombinator.com | 2022-11-15Re: HF - we know them and admire their work (primarily, until very recently, focused on NLP, while we focus mostly on CV). As mentioned in the post, a large part of Deep Lake, including the Python-based dataloader and dataset format, is open source as well - https://github.com/activeloopai/deeplake.
Likewise, we curate a list of large open source datasets here -> https://datasets.activeloop.ai/docs/ml/, but our main thing isn't aggregating datasets (focus for HF datasets), but rather providing people with a way to manage their data efficiently. That being said, all of the 125+ public datasets we have are available in seconds with one line of code. :)
We haven't benchmarked against HF datasets in a while, but Deep Lake's dataloader is much, much faster in third-party benchmarks (see this https://arxiv.org/pdf/2209.13705 and here for an older version, that was much slower than what we have now, see this: https://pasteboard.co/la3DmCUR2iFb.png). HF under the hood uses Git-LFS (to the best of my knowledge) and is not opinionated on formats, so LAION just dumps Parquet files on their storage.
While your setup would work for a few TBs, scaling to PB would be tricky including maintaining your own infrastructure. And yep, as you said NAS/NFS would neither be able to handle the scale (especially writes with 1k workers). I am also slightly curious about your use of mmap files with image/video compressed data (as zero-copy won’t happen) unless you decompress inside the GPU ;), but would love to learn more from you! Re: pricing thanks for the feedback, storage is one component and customly priced for PB-scale workloads.
-
image-super-resolution
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
Project mention: A tech worker is selling a children's book he made using AI. Professional illustrators are pissed. | reddit.com/r/books | 2022-12-15 -
LaTeX-OCR
-
-
SimpleCV
-
geemap
A Python package for interactive mapping with Google Earth Engine, ipyleaflet, and ipywidgets.
Project mention: I'm building an IDE and open source library to make it easier to work with geospatial data using Python | reddit.com/r/Python | 2022-11-04 -
pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
-
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
-
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
The description of this is kind of confusing but I think the easiest way to understand it is that it is a data processing pipeline of sorts. Take unstructured data and apply transformation and computation. A similar project to this is Towhee (https://github.com/towhee-io/towhee). This project tries to simplify unstructured data processing and provides pretrained models and pipelines from their hub.
-
-
-
Project mention: Question about implementing RL algorithms | reddit.com/r/reinforcementlearning | 2022-11-19
In the lightning-bolts repository, they implement the different RL algorithms, such as PPO and DQN, as different models. Would it make more sense to have the different algorithms be the Trainer instead? Inside each of the implementations, the model creates the same neural network with different training steps.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Image processing related posts
- Primify: Embed any image into a prime number
- GitHub - benhoyt/dhash: Python library to calculate the difference hash (perceptual hash) for a given image, useful for detecting duplicates
- Medical Image Segmentation Human Retina
- I created an online tool to generate Vim and Emacs themes from VS Code theme's JSON files.
- PXL8 V1 out now, dream in pixels (for free)
- Where do you BUY your ebooks from?
- My first rice (Qtile + Gruvbox)
-
A note from our sponsor - Sonar
www.sonarsource.com | 28 Jan 2023
Index
What are some of the best open-source Image processing projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | EasyOCR | 16,848 |
2 | albumentations | 11,460 |
3 | pillow | 10,469 |
4 | google-images-download | 8,195 |
5 | OCRmyPDF | 8,056 |
6 | rembg | 8,004 |
7 | Kornia | 7,636 |
8 | pywal | 6,955 |
9 | segmentation_models.pytorch | 6,694 |
10 | darkflow | 6,062 |
11 | scikit-image | 5,202 |
12 | deeplake | 5,189 |
13 | image-super-resolution | 4,010 |
14 | LaTeX-OCR | 2,769 |
15 | blind_watermark | 2,749 |
16 | SimpleCV | 2,594 |
17 | geemap | 2,493 |
18 | pdftabextract | 2,035 |
19 | PySceneDetect | 1,847 |
20 | towhee | 1,772 |
21 | pywt | 1,588 |
22 | pytorch-ssim | 1,562 |
23 | lightning-bolts | 1,462 |