Top 23 Image processing Open-Source Projects

OpenCV

196 75,566 9.9 C++

Open Source Computer Vision Library

Project mention: การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python | dev.to | 2024-04-16

Referenceshttps https://www.kaggle.com/datasets/riyaelizashaju/skin-disease-image-dataset-balanced?fbclid=IwAR3wbTp8l5yo_5fx6HAX8Vd2-9cca3khAc8EiBGFObaALfdVid29IuB_rYE https://keras.io/api/applications/vgg/ https://www.tensorflow.org/tutorials/images/cnn?hl=th https://opencv.org/

tesseract-ocr

121 58,022 8.9 C++

Tesseract Open Source OCR Engine (main repository)

Project mention: Highlighting Image Text | dev.to | 2024-04-30

We are going to be using an OCR (Optical Character Recognition) engine called Tesseract for the image-to-text recognition part. It is free software, released under the Apache License. Install the engine for your desired OS from their official website. I'm using Windows for this. Add the installation path to your environment variables.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
sharp

97 27,943 9.4 JavaScript

High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.

Project mention: Next.js and Bunny CDN: Complete Guide to Image Uploading with Server Actions | dev.to | 2024-04-21

Last thing left is to use our new upload function in our server action. Since I like to upload images in single format and have some more control over them, I will additionally use sharp library. For file name, I'll generate some random string using nanoid:

EasyOCR

38 21,953 3.6 Python

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

squoosh

267 20,944 5.8 TypeScript

Make images smaller using best-in-class codecs, right in the browser.

Project mention: SVG Viewer – View, edit, and optimize SVGs | news.ycombinator.com | 2024-04-27

Here's another handy tool that I use: https://squoosh.app/

CVPR2024-Papers-with-Code

1 16,155 6.5

CVPR 2024 论文和开源项目合集
filepond

14 14,663 7.1 JavaScript

🌊 A flexible and fun JavaScript file upload library

Project mention: Can anyone suggest PHP, JavaScript File Manager tool with Crop tool integrated? | /r/PHP | 2023-05-22

Have a look at https://pqina.nl/filepond/

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
rembg

52 14,536 7.9 Python

Rembg is a tool to remove images background

Project mention: Rembg: Tool to Remove Images Background | news.ycombinator.com | 2024-03-19

supervision

15 13,984 9.9 Python

We write your reusable computer vision tools. 💜

Project mention: Supervision: Reusable Computer Vision | news.ycombinator.com | 2024-03-24

You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.

albumentations

28 13,425 8.9 Python

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Project mention: Augment specific classes? | /r/computervision | 2023-12-06

You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations

smartcrop.js

2 12,786 4.1 JavaScript

Content aware image cropping

Project mention: Just In Time Image Optimization at Reddit Scale | /r/RedditEng | 2023-06-28

We chose to use govips which is a cgo wrapper around the libvips image manipulation library. The majority of new development for services in our backend is written using baseplate.go. But Go is not an ideal choice for media processing as it cannot keep up with the performance of native code. The most widely used image-processing libraries like libmagick are primarily written in C or C++. Speed was a major factor in selecting libvips in order to keep latency low on CDN cache misses for images. In our tests, libvips was 3–4 times faster than libmagick on basic image processing operations. Content-aware smart cropping was implemented by porting smartcrop.js to Go. This is the only operation implemented in pure Go.

cropperjs

8 12,658 5.8 JavaScript

JavaScript image cropper.

Project mention: How to Implement Partial Screenshare | dev.to | 2023-11-09

We use the browser media devices API to bring up the screen selection dialog. After this, we handle the selection of a portion of interest using CropperJs.

OCRmyPDF

77 12,002 9.5 Python

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14

Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.

pillow

44 11,702 9.9 Python

Python Imaging Library (Fork)

Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29

Pillow (https://python-pillow.org/)

LaTeX-OCR

21 10,770 3.6 Python

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Project mention: Detexify LaTeX Handwriting Symbol Recognition | news.ycombinator.com | 2023-11-14

caire

3 10,321 5.1 Go

Content aware image resize library
glide-transformations

0 9,854 0.0 Java

An Android transformation library providing a variety of image transformations for Glide.
Kornia

11 9,395 9.4 Python

Geometric Computer Vision Library for Spatial AI
libvips

23 9,005 9.2 C

A fast image processing library with low memory needs.

Project mention: Building an online image compressor | dev.to | 2024-01-09

After some research, I found libvips, a demand-driven, horizontally threaded image processing library. It is designed to run quickly while using as little as memory as possible.

segmentation_models.pytorch

14 8,844 4.1 Python

Segmentation models with pretrained backbones. PyTorch.

Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09

Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models

google-images-download

2 8,498 0.0 Python

Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
imgproxy

28 8,240 9.3 Go

Fast and secure standalone server for resizing and converting remote images

Project mention: How Can I Streamline My Image Prep | /r/webdev | 2023-12-06

We use imgproxy which does 2->4 and all you need to do is change the URL on your site.

U-2-Net

30 8,115 3.1 Python

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17

I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Image processing related posts

Highlighting Image Text

1 project | dev.to | 30 Apr 2024
Vision AI agents for any task

1 project | dev.to | 30 Apr 2024
SVG Viewer – View, edit, and optimize SVGs

4 projects | news.ycombinator.com | 27 Apr 2024
การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python

1 project | dev.to | 16 Apr 2024
Jpegli: A New JPEG Coding Library

9 projects | news.ycombinator.com | 3 Apr 2024
Show HN: OS Image processing API running on edge functions using Rust and WASM

3 projects | news.ycombinator.com | 4 Apr 2024
Exploring the Top SERP (Search Engine Result Pages) APIs

1 project | dev.to | 3 Apr 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 1 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Image processing projects? This list will help you:

	Project	Stars
1	OpenCV	75,566
2	tesseract-ocr	58,022
3	sharp	27,943
4	EasyOCR	21,953
5	squoosh	20,944
6	CVPR2024-Papers-with-Code	16,155
7	filepond	14,663
8	rembg	14,536
9	supervision	13,984
10	albumentations	13,425
11	smartcrop.js	12,786
12	cropperjs	12,658
13	OCRmyPDF	12,002
14	pillow	11,702
15	LaTeX-OCR	10,770
16	caire	10,321
17	glide-transformations	9,854
18	Kornia	9,395
19	libvips	9,005
20	segmentation_models.pytorch	8,844
21	google-images-download	8,498
22	imgproxy	8,240
23	U-2-Net	8,115

Image processing

Top 23 Image processing Open-Source Projects

Image processing related posts

Highlighting Image Text

Vision AI agents for any task

SVG Viewer – View, edit, and optimize SVGs

การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python

Jpegli: A New JPEG Coding Library

Show HN: OS Image processing API running on edge functions using Rust and WASM

Exploring the Top SERP (Search Engine Result Pages) APIs

Index