Top 23 Image processing Open-Source Projects

OpenCV

196 75,692 9.9 C++

Open Source Computer Vision Library

Project mention: การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python | dev.to | 2024-04-16

Referenceshttps https://www.kaggle.com/datasets/riyaelizashaju/skin-disease-image-dataset-balanced?fbclid=IwAR3wbTp8l5yo_5fx6HAX8Vd2-9cca3khAc8EiBGFObaALfdVid29IuB_rYE https://keras.io/api/applications/vgg/ https://www.tensorflow.org/tutorials/images/cnn?hl=th https://opencv.org/

tesseract-ocr

121 58,182 8.9 C++

Tesseract Open Source OCR Engine (main repository)

Project mention: Highlighting Image Text | dev.to | 2024-04-30

We are going to be using an OCR (Optical Character Recognition) engine called Tesseract for the image-to-text recognition part. It is free software, released under the Apache License. Install the engine for your desired OS from their official website. I'm using Windows for this. Add the installation path to your environment variables.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
sharp

98 27,987 9.4 JavaScript

High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.

Project mention: How to resize images for Open Graph and Twitter using sharp | dev.to | 2024-05-08

When sharing content on social media platforms, it's essential to have visually appealing images that are properly sized. Let’s explore how we could automatically resize images for Open Graph and Twitter card previews. We’ll be using sharp - a powerful and fast tool that powers the Image component from Next.js.

EasyOCR

38 22,049 3.6 Python

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

squoosh

267 20,981 5.8 TypeScript

Make images smaller using best-in-class codecs, right in the browser.

Project mention: SVG Viewer – View, edit, and optimize SVGs | news.ycombinator.com | 2024-04-27

Here's another handy tool that I use: https://squoosh.app/

CVPR2024-Papers-with-Code

1 16,227 6.5

CVPR 2024 论文和开源项目合集
filepond

14 14,677 7.1 JavaScript

🌊 A flexible and fun JavaScript file upload library

Project mention: Can anyone suggest PHP, JavaScript File Manager tool with Crop tool integrated? | /r/PHP | 2023-05-22

Have a look at https://pqina.nl/filepond/

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
rembg

52 14,628 7.9 Python

Rembg is a tool to remove images background

Project mention: Rembg: Tool to Remove Images Background | news.ycombinator.com | 2024-03-19

supervision

15 14,068 9.9 Python

We write your reusable computer vision tools. 💜

Project mention: Supervision: Reusable Computer Vision | news.ycombinator.com | 2024-03-24

You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.

albumentations

28 13,451 8.9 Python

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Project mention: Augment specific classes? | /r/computervision | 2023-12-06

You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations

smartcrop.js

2 12,784 4.1 JavaScript

Content aware image cropping

Project mention: Just In Time Image Optimization at Reddit Scale | /r/RedditEng | 2023-06-28

We chose to use govips which is a cgo wrapper around the libvips image manipulation library. The majority of new development for services in our backend is written using baseplate.go. But Go is not an ideal choice for media processing as it cannot keep up with the performance of native code. The most widely used image-processing libraries like libmagick are primarily written in C or C++. Speed was a major factor in selecting libvips in order to keep latency low on CDN cache misses for images. In our tests, libvips was 3–4 times faster than libmagick on basic image processing operations. Content-aware smart cropping was implemented by porting smartcrop.js to Go. This is the only operation implemented in pure Go.

cropperjs

8 12,672 5.8 JavaScript

JavaScript image cropper.

Project mention: How to Implement Partial Screenshare | dev.to | 2023-11-09

We use the browser media devices API to bring up the screen selection dialog. After this, we handle the selection of a portion of interest using CropperJs.

OCRmyPDF

77 12,067 9.5 Python

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14

Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.

pillow

44 11,722 9.9 Python

Python Imaging Library (Fork)

Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29

Pillow (https://python-pillow.org/)

LaTeX-OCR

21 10,860 3.6 Python

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Project mention: Detexify LaTeX Handwriting Symbol Recognition | news.ycombinator.com | 2023-11-14

caire

3 10,322 5.1 Go

Content aware image resize library
glide-transformations

0 9,854 0.0 Java

An Android transformation library providing a variety of image transformations for Glide.
Kornia

11 9,429 9.4 Python

Geometric Computer Vision Library for Spatial AI
libvips

24 9,029 9.2 C

A fast image processing library with low memory needs.

Project mention: Ask HN: How to handle user file uploads? | news.ycombinator.com | 2024-05-03

Read through the comments and was surprised no one mentioned libvips - https://github.com/libvips/libvips. At my current small company we were trying to allow image uploads and started with imagemagick but certain images took too long to process and we were looking for faster alternatives. It's a great tool with minimum overhead. For video thumbnails, we use ffmpeg which is really heavy. We off-load video thumbnail generation to a queue. We've had great luck with these tools.

segmentation_models.pytorch

14 8,862 4.1 Python

Segmentation models with pretrained backbones. PyTorch.

Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09

Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models

google-images-download

2 8,499 0.0 Python

Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
imgproxy

29 8,263 9.3 Go

Fast and secure standalone server for resizing and converting remote images

Project mention: Ask HN: How to handle user file uploads? | news.ycombinator.com | 2024-05-03

In my project[1], I convert all user-uploaded images to high-quality webp and store them like that. I discard the original files after the conversion. I use imgproxy[2] to further resize and convert them on the fly for actual display.
I don't do videos yet, but I'm kinda terrified of the idea of putting user-uploaded files through ffmpeg if/when I'll support them.
[1] https://github.com/grishka/Smithereen
[2] https://github.com/imgproxy/imgproxy

U-2-Net

30 8,134 3.1 Python

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17

I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Image processing related posts

Ask HN: How to handle user file uploads?

10 projects | news.ycombinator.com | 3 May 2024
Highlighting Image Text

1 project | dev.to | 30 Apr 2024
Vision AI agents for any task

1 project | dev.to | 30 Apr 2024
SVG Viewer – View, edit, and optimize SVGs

4 projects | news.ycombinator.com | 27 Apr 2024
การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python

1 project | dev.to | 16 Apr 2024
Jpegli: A New JPEG Coding Library

9 projects | news.ycombinator.com | 3 Apr 2024
Show HN: OS Image processing API running on edge functions using Rust and WASM

3 projects | news.ycombinator.com | 4 Apr 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 8 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Image processing projects? This list will help you:

	Project	Stars
1	OpenCV	75,692
2	tesseract-ocr	58,182
3	sharp	27,987
4	EasyOCR	22,049
5	squoosh	20,981
6	CVPR2024-Papers-with-Code	16,227
7	filepond	14,677
8	rembg	14,628
9	supervision	14,068
10	albumentations	13,451
11	smartcrop.js	12,784
12	cropperjs	12,672
13	OCRmyPDF	12,067
14	pillow	11,722
15	LaTeX-OCR	10,860
16	caire	10,322
17	glide-transformations	9,854
18	Kornia	9,429
19	libvips	9,029
20	segmentation_models.pytorch	8,862
21	google-images-download	8,499
22	imgproxy	8,263
23	U-2-Net	8,134

Image processing

Top 23 Image processing Open-Source Projects

Image processing related posts

Ask HN: How to handle user file uploads?

Highlighting Image Text

Vision AI agents for any task

SVG Viewer – View, edit, and optimize SVGs

การจำแนกสายพันธุ์มะม่วง โดยใช้ Visual Geometry Group 16 (VGG16) ใน Python

Jpegli: A New JPEG Coding Library

Show HN: OS Image processing API running on edge functions using Rust and WASM

Index