Image processing

Open-source projects categorized as Image processing

Top 23 Image processing Open-Source Projects

  • OpenCV

    Open Source Computer Vision Library

    Project mention: Opencv-Python adds support for Pathlike objects | news.ycombinator.com | 2024-01-23
  • tesseract-ocr

    Tesseract Open Source OCR Engine (main repository)

    Project mention: one of the Codia AI Design technologies: OCR Technology | dev.to | 2024-02-14

    You will also need to install the Tesseract OCR engine, which can be downloaded and installed from the following link: https://github.com/tesseract-ocr/tesseract

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • sharp

    High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.

    Project mention: Organize the mess of your photo folders with Node | dev.to | 2023-11-21

    sharp

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

    PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • squoosh

    Make images smaller using best-in-class codecs, right in the browser.

    Project mention: Optimizing Images for Developer Blogs | dev.to | 2024-02-20

    Squoosh: A webpage that allows you to quickly optimize images for your blog.

  • CVPR2023-Papers-with-Code

    CVPR 2023 论文和开源项目合集

  • filepond

    🌊 A flexible and fun JavaScript file upload library

    Project mention: Can anyone suggest PHP, JavaScript File Manager tool with Crop tool integrated? | /r/PHP | 2023-05-22

    Have a look at https://pqina.nl/filepond/

  • Onboard AI

    ChatGPT with full context of any GitHub repo. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at app.getonboardai.com.

  • rembg

    Rembg is a tool to remove images background

    Project mention: 🚀 Background Removal in Python with PyTorch and Rembg! 🎨🐍 | /r/programming | 2023-12-09

    A bit conflicted as the linked video is also linked from the actual rembg repo but it seems way faster and more detailed to just read the readme at that repo first, and maybe use a video if something doesn’t make sense.

  • albumentations

    Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

    Project mention: Augment specific classes? | /r/computervision | 2023-12-06

    You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations

  • smartcrop.js

    Content aware image cropping

    Project mention: Just In Time Image Optimization at Reddit Scale | /r/RedditEng | 2023-06-28

    We chose to use govips which is a cgo wrapper around the libvips image manipulation library. The majority of new development for services in our backend is written using baseplate.go. But Go is not an ideal choice for media processing as it cannot keep up with the performance of native code. The most widely used image-processing libraries like libmagick are primarily written in C or C++. Speed was a major factor in selecting libvips in order to keep latency low on CDN cache misses for images. In our tests, libvips was 3–4 times faster than libmagick on basic image processing operations. Content-aware smart cropping was implemented by porting smartcrop.js to Go. This is the only operation implemented in pure Go.

  • cropperjs

    JavaScript image cropper.

    Project mention: How to Implement Partial Screenshare | dev.to | 2023-11-09

    We use the browser media devices API to bring up the screen selection dialog. After this, we handle the selection of a portion of interest using CropperJs.

  • pillow

    Python Imaging Library (Fork)

    Project mention: Github.com Is Down | news.ycombinator.com | 2023-06-29

    "down" could be a good name for a python image library plugin/extension.

    https://github.com/python-pillow/Pillow

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

    Project mention: FLaNK Stack Weekly 19 Feb 2024 | dev.to | 2024-02-19
  • caire

    Content aware image resize library

  • LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

    Project mention: Detexify LaTeX Handwriting Symbol Recognition | news.ycombinator.com | 2023-11-14
  • glide-transformations

    An Android transformation library providing a variety of image transformations for Glide.

  • supervision

    We write your reusable computer vision tools. 💜

    Project mention: Ask HN: Who is hiring? (February 2024) | news.ycombinator.com | 2024-02-01

    Roboflow | Open Source Software Engineer, Web Designer / Developer, and more. | Full-time (Remote, SF, NYC) | https://roboflow.com/careers?ref=whoishiring0224

    Roboflow is the fastest way to use computer vision in production. We help developers give their software the sense of sight. Our end-to-end platform[1] provides tooling for image collection, annotation, dataset exploration and curation, training, and deployment.

    Over 250k engineers (including engineers from 2/3 Fortune 100 companies) build with Roboflow. We now host the largest collection of open source computer vision datasets and pre-trained models[2]. We are pushing forward the CV ecosystem with open source projects like Autodistill[3] and Supervision[4]. And we've built one of the most comprehensive resources for software engineers to learn to use computer vision with our popular blog[5] and YouTube channel[6].

    We have several openings available but are primarily looking for strong technical generalists who want to help us democratize computer vision and like to wear many hats and have an outsized impact. Our engineering culture is built on a foundation of autonomy & we don't consider an engineer fully ramped until they can "choose their own loss function". At Roboflow, engineers aren't just responsible for building things but also for helping us figure out what we should build next. We're builders & problem solvers; not just coders. (For this reason we also especially love hiring past and future founders.)

    We're currently hiring full-stack engineers for our ML and web platform teams, a web developer to bridge our product and marketing teams, several technical roles on the sales & field engineering teams, and our first applied machine learning researcher to help push forward the state of the art in computer vision.

    [1]: https://roboflow.com/?ref=whoishiring0224

    [2]: https://roboflow.com/universe?ref=whoishiring0224

    [3]: https://github.com/autodistill/autodistill

    [4]: https://github.com/roboflow/supervision

    [5]: https://blog.roboflow.com/?ref=whoishiring0224

    [6]: https://www.youtube.com/@Roboflow

  • Kornia

    Geometric Computer Vision Library for Spatial AI

  • libvips

    A fast image processing library with low memory needs.

    Project mention: Building an online image compressor | dev.to | 2024-01-09

    After some research, I found libvips, a demand-driven, horizontally threaded image processing library. It is designed to run quickly while using as little as memory as possible.

  • segmentation_models.pytorch

    Segmentation models with pretrained backbones. PyTorch.

    Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09

    Also, I’d suggest considering switching to the segmentation-models library - it provides U-Net models with a variety of pretrained backbones of as encoders. The author also put out a PyTorch version. https://github.com/qubvel/segmentation_models.pytorch https://github.com/qubvel/segmentation_models

  • google-images-download

    Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!

  • imgproxy

    Fast and secure standalone server for resizing and converting remote images

    Project mention: How Can I Streamline My Image Prep | /r/webdev | 2023-12-06

    We use imgproxy which does 2->4 and all you need to do is change the URL on your site.

  • U-2-Net

    The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

    Project mention: I used the ChatGPT API to create a proof-of-concept AI driven video game. Using generative AI for the images and dialogue and GPT-3.5 for narrative and game control. More info in comments. | /r/ChatGPT | 2023-06-17

    I use a finetuned custom Stable Diffusion model in combination with a style embedding for the characters for image generation and U²-Net for background removal.

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-20.

Image processing related posts

Index

What are some of the best open-source Image processing projects? This list will help you:

Project Stars
1 OpenCV 74,087
2 tesseract-ocr 56,504
3 sharp 27,441
4 EasyOCR 21,106
5 squoosh 20,468
6 CVPR2023-Papers-with-Code 14,543
7 filepond 14,469
8 rembg 13,225
9 albumentations 13,108
10 smartcrop.js 12,748
11 cropperjs 12,483
12 pillow 11,478
13 OCRmyPDF 11,334
14 caire 10,306
15 LaTeX-OCR 9,998
16 glide-transformations 9,832
17 supervision 9,594
18 Kornia 9,114
19 libvips 8,798
20 segmentation_models.pytorch 8,524
21 google-images-download 8,462
22 imgproxy 7,990
23 U-2-Net 7,875
The modern API for authentication & user identity.
The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
workos.com