Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Image processing Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
sharp
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
albumentations
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
-
glide-transformations
An Android transformation library providing a variety of image transformations for Glide.
-
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
-
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python | dev.to | 2024-04-16Referenceshttps https://www.kaggle.com/datasets/riyaelizashaju/skin-disease-image-dataset-balanced?fbclid=IwAR3wbTp8l5yo_5fx6HAX8Vd2-9cca3khAc8EiBGFObaALfdVid29IuB_rYE https://keras.io/api/applications/vgg/ https://www.tensorflow.org/tutorials/images/cnn?hl=th https://opencv.org/
We are going to be using an OCR (Optical Character Recognition) engine called Tesseract for the image-to-text recognition part. It is free software, released under the Apache License. Install the engine for your desired OS from their official website. I'm using Windows for this. Add the installation path to your environment variables.
When sharing content on social media platforms, it's essential to have visually appealing images that are properly sized. Let’s explore how we could automatically resize images for Open Graph and Twitter card previews. We’ll be using sharp - a powerful and fast tool that powers the Image component from Next.js.
Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
Here's another handy tool that I use: https://squoosh.app/
Project mention: Can anyone suggest PHP, JavaScript File Manager tool with Crop tool integrated? | /r/PHP | 2023-05-22Have a look at https://pqina.nl/filepond/
You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.
You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations
We chose to use govips which is a cgo wrapper around the libvips image manipulation library. The majority of new development for services in our backend is written using baseplate.go. But Go is not an ideal choice for media processing as it cannot keep up with the performance of native code. The most widely used image-processing libraries like libmagick are primarily written in C or C++. Speed was a major factor in selecting libvips in order to keep latency low on CDN cache misses for images. In our tests, libvips was 3–4 times faster than libmagick on basic image processing operations. Content-aware smart cropping was implemented by porting smartcrop.js to Go. This is the only operation implemented in pure Go.
We use the browser media devices API to bring up the screen selection dialog. After this, we handle the selection of a portion of interest using CropperJs.
Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29Pillow (https://python-pillow.org/)
Read through the comments and was surprised no one mentioned libvips - https://github.com/libvips/libvips. At my current small company we were trying to allow image uploads and started with imagemagick but certain images took too long to process and we were looking for faster alternatives. It's a great tool with minimum overhead. For video thumbnails, we use ffmpeg which is really heavy. We off-load video thumbnail generation to a queue. We've had great luck with these tools.
Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models
In my project[1], I convert all user-uploaded images to high-quality webp and store them like that. I discard the original files after the conversion. I use imgproxy[2] to further resize and convert them on the fly for actual display.
I don't do videos yet, but I'm kinda terrified of the idea of putting user-uploaded files through ffmpeg if/when I'll support them.
[1] https://github.com/grishka/Smithereen
[2] https://github.com/imgproxy/imgproxy
Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.
Image processing related posts
-
Ask HN: How to handle user file uploads?
-
Highlighting Image Text
-
Vision AI agents for any task
-
SVG Viewer – View, edit, and optimize SVGs
-
การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python
-
Jpegli: A New JPEG Coding Library
-
Show HN: OS Image processing API running on edge functions using Rust and WASM
-
A note from our sponsor - InfluxDB
www.influxdata.com | 8 May 2024
Index
What are some of the best open-source Image processing projects? This list will help you:
Project | Stars | |
---|---|---|
1 | OpenCV | 75,692 |
2 | tesseract-ocr | 58,182 |
3 | sharp | 27,987 |
4 | EasyOCR | 22,049 |
5 | squoosh | 20,981 |
6 | CVPR2024-Papers-with-Code | 16,227 |
7 | filepond | 14,677 |
8 | rembg | 14,628 |
9 | supervision | 14,068 |
10 | albumentations | 13,451 |
11 | smartcrop.js | 12,784 |
12 | cropperjs | 12,672 |
13 | OCRmyPDF | 12,067 |
14 | pillow | 11,722 |
15 | LaTeX-OCR | 10,860 |
16 | caire | 10,322 |
17 | glide-transformations | 9,854 |
18 | Kornia | 9,429 |
19 | libvips | 9,029 |
20 | segmentation_models.pytorch | 8,862 |
21 | google-images-download | 8,499 |
22 | imgproxy | 8,263 |
23 | U-2-Net | 8,134 |
Sponsored