Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Image processing Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
sharp
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
-
glide-transformations
An Android transformation library providing a variety of image transformations for Glide.
-
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
-
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python | dev.to | 2024-04-16Referenceshttps https://www.kaggle.com/datasets/riyaelizashaju/skin-disease-image-dataset-balanced?fbclid=IwAR3wbTp8l5yo_5fx6HAX8Vd2-9cca3khAc8EiBGFObaALfdVid29IuB_rYE https://keras.io/api/applications/vgg/ https://www.tensorflow.org/tutorials/images/cnn?hl=th https://opencv.org/
We are going to be using an OCR (Optical Character Recognition) engine called Tesseract for the image-to-text recognition part. It is free software, released under the Apache License. Install the engine for your desired OS from their official website. I'm using Windows for this. Add the installation path to your environment variables.
Project mention: Next.js and Bunny CDN: Complete Guide to Image Uploading with Server Actions | dev.to | 2024-04-21Last thing left is to use our new upload function in our server action. Since I like to upload images in single format and have some more control over them, I will additionally use sharp library. For file name, I'll generate some random string using nanoid:
Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
Here's another handy tool that I use: https://squoosh.app/
Project mention: Can anyone suggest PHP, JavaScript File Manager tool with Crop tool integrated? | /r/PHP | 2023-05-22Have a look at https://pqina.nl/filepond/
You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.
You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations
We chose to use govips which is a cgo wrapper around the libvips image manipulation library. The majority of new development for services in our backend is written using baseplate.go. But Go is not an ideal choice for media processing as it cannot keep up with the performance of native code. The most widely used image-processing libraries like libmagick are primarily written in C or C++. Speed was a major factor in selecting libvips in order to keep latency low on CDN cache misses for images. In our tests, libvips was 3–4 times faster than libmagick on basic image processing operations. Content-aware smart cropping was implemented by porting smartcrop.js to Go. This is the only operation implemented in pure Go.
We use the browser media devices API to bring up the screen selection dialog. After this, we handle the selection of a portion of interest using CropperJs.
Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29Pillow (https://python-pillow.org/)
After some research, I found libvips, a demand-driven, horizontally threaded image processing library. It is designed to run quickly while using as little as memory as possible.
Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models
We use imgproxy which does 2->4 and all you need to do is change the URL on your site.
Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.
Image processing related posts
-
Highlighting Image Text
-
Vision AI agents for any task
-
SVG Viewer – View, edit, and optimize SVGs
-
การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python
-
Jpegli: A New JPEG Coding Library
-
Show HN: OS Image processing API running on edge functions using Rust and WASM
-
Exploring the Top SERP (Search Engine Result Pages) APIs
-
A note from our sponsor - InfluxDB
www.influxdata.com | 1 May 2024
Index
What are some of the best open-source Image processing projects? This list will help you:
Project | Stars | |
---|---|---|
1 | OpenCV | 75,566 |
2 | tesseract-ocr | 58,022 |
3 | sharp | 27,943 |
4 | EasyOCR | 21,953 |
5 | squoosh | 20,944 |
6 | CVPR2024-Papers-with-Code | 16,155 |
7 | filepond | 14,663 |
8 | rembg | 14,536 |
9 | supervision | 13,984 |
10 | albumentations | 13,425 |
11 | smartcrop.js | 12,786 |
12 | cropperjs | 12,658 |
13 | OCRmyPDF | 12,002 |
14 | pillow | 11,702 |
15 | LaTeX-OCR | 10,770 |
16 | caire | 10,321 |
17 | glide-transformations | 9,854 |
18 | Kornia | 9,395 |
19 | libvips | 9,005 |
20 | segmentation_models.pytorch | 8,844 |
21 | google-images-download | 8,498 |
22 | imgproxy | 8,240 |
23 | U-2-Net | 8,115 |
Sponsored