Top 23 Python image-classification Projects

pytorch-image-models

35 29,751 9.4 Python

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Project mention: FLaNK AI Weekly 18 March 2024 | dev.to | 2024-03-18

ultralytics

27 22,624 9.8 Python

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Project mention: The CEO of Ultralytics (yolov8) using LLMs to engage with commenters on GitHub | news.ycombinator.com | 2024-02-12

Yep, I noticed this a while ago. It posts easily identifiable ChatGPT responses. It also posts garbage wrong answers which makes it worse than useless. Totally disrespectful to the userbase.
https://github.com/ultralytics/ultralytics/issues/5748#issue...

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
vit-pytorch

11 17,910 7.3 Python

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Project mention: Is it easier to go from Pytorch to TF and Keras than the other way around? | /r/pytorch | 2023-05-13

I also need to learn Pyspark so right now I am going to download the Fashion Mnist dataset, use Pyspark to downsize each image and put the into separate folders according to their labels (just to show employers I can do some basic ETL with Pyspark, not sure how I am going to load for training in Pytorch yet though). Then I am going to write the simplest Le Net to try to categorize the fashion MNIST dataset (results will most likely be bad but it's okay). Next, try to learn transfer learning in Pytorch for both CNN or maybe skip ahead to ViT. Ideally at this point I want to study the Attention mechanism a bit more and try to implement Simple Vit which I saw here: https://github.com/lucidrains/vit-pytorch/blob/main/vit_pytorch/simple_vit.py

albumentations

28 13,395 8.3 Python

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Project mention: Augment specific classes? | /r/computervision | 2023-12-06

You can use albumentations if you are comfortable with using open source libraries https://github.com/albumentations-team/albumentations

Swin-Transformer

23 12,917 3.1 Python

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Project mention: Samsung expected to report 80% profit plunge as losses mount at chip business | news.ycombinator.com | 2023-10-10

> there is really nothing that "normal" AI requires that is bound to CUDA. pyTorch and Tensorflow are backend agnostic (ideally...).
There are a lot of optimizations that CUDA has that are nowhere near supported in other software or even hardware. Custom cuda kernels also aren't as rare as one might think, they will often just be hidden unless you're looking at libraries. Our more well known example is going to be StyleGAN[0] but it isn't uncommon to see elsewhere, even in research code. Swin even has a cuda kernel[1]. Or find torch here[1] (which github reports that 4% of the code is cuda (and 42% C++ and 2% C)). These things are everywhere. I don't think pytorch and tensorflow could ever be agnostic, there will always be a difference just because you have to spend resources differently (developing kernels is time resource). We can draw evidence by looking at Intel MKL, which is still better than open source libraries and has been so for a long time.
I really do want AMD to compete in this space. I'd even love a third player like Intel. We really do need competition here, but it would be naive to think that there's going to be a quick catchup here. AMD has a lot of work to do and posting a few bounties and starting a company (idk, called "micro grad"?) isn't going to solve the problem anytime soon.
And fwiw, I'm willing to bet that most AI companies would rather run in house servers than from cloud service providers. The truth is that right now just publishing is extremely correlated to compute infrastructure (doesn't need to be but with all the noise we've just said "fuck the poor" because rejecting is easy) and anyone building products has costly infrastructure.
[0] https://github.com/NVlabs/stylegan2-ada-pytorch/blob/d72cc7d...
[1] https://github.com/microsoft/Swin-Transformer/blob/2cb103f2d...
[2] https://github.com/pytorch/pytorch/tree/main/aten/src

pytorch-grad-cam

5 9,410 5.4 Python

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Project mention: Exploring GradCam and More with FiftyOne | dev.to | 2024-02-13

For the two examples we will be looking at, we will be using pytorch_grad_cam, an incredible open source package that makes working with GradCam very easy. There are excellent other tutorials to check out on the repo as well.

autogluon

8 7,091 9.6 Python

AutoGluon: Fast and Accurate ML in 3 Lines of Code
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
gluon-cv

1 5,751 1.8 Python

Gluon CV Toolkit
PaddleClas

2 5,251 5.6 Python

A treasure chest for visual classification and recognition powered by PaddlePaddle
hub

1 3,436 3.7 Python

A library for transfer learning by reusing parts of TensorFlow models. (by tensorflow)
catalyst

1 3,223 0.0 Python

Accelerated deep learning R&D (by catalyst-team)

Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09

mmpretrain

2 3,156 7.8 Python

OpenMMLab Pre-training Toolbox and Benchmark
efficientnet

9 2,057 0.0 Python

Implementation of EfficientNet model. Keras and TensorFlow Keras.

Project mention: Getting Started with Gemma Models | dev.to | 2024-04-15

Examples of lightweight models include MobileNet, a computer vision model designed for mobile and embedded vision applications, EfficientDet, an object detection model, and EfficientNet, a CNN that uses compound scaling to enable better performance. All these are lightweight models from Google.

sparseml

12 1,974 9.6 Python

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
ailia-models

4 1,814 9.8 Python

The collection of pre-trained, state-of-the-art AI models for ailia SDK
autodistill

13 1,520 9.2 Python

Images to inference with no labeling (use foundation models to train supervised models).

Project mention: Ask HN: Who is hiring? (February 2024) | news.ycombinator.com | 2024-02-01

Roboflow | Open Source Software Engineer, Web Designer / Developer, and more. | Full-time (Remote, SF, NYC) | https://roboflow.com/careers?ref=whoishiring0224
Roboflow is the fastest way to use computer vision in production. We help developers give their software the sense of sight. Our end-to-end platform[1] provides tooling for image collection, annotation, dataset exploration and curation, training, and deployment.
Over 250k engineers (including engineers from 2/3 Fortune 100 companies) build with Roboflow. We now host the largest collection of open source computer vision datasets and pre-trained models[2]. We are pushing forward the CV ecosystem with open source projects like Autodistill[3] and Supervision[4]. And we've built one of the most comprehensive resources for software engineers to learn to use computer vision with our popular blog[5] and YouTube channel[6].
We have several openings available but are primarily looking for strong technical generalists who want to help us democratize computer vision and like to wear many hats and have an outsized impact. Our engineering culture is built on a foundation of autonomy & we don't consider an engineer fully ramped until they can "choose their own loss function". At Roboflow, engineers aren't just responsible for building things but also for helping us figure out what we should build next. We're builders & problem solvers; not just coders. (For this reason we also especially love hiring past and future founders.)
We're currently hiring full-stack engineers for our ML and web platform teams, a web developer to bridge our product and marketing teams, several technical roles on the sales & field engineering teams, and our first applied machine learning researcher to help push forward the state of the art in computer vision.
[1]: https://roboflow.com/?ref=whoishiring0224
[2]: https://roboflow.com/universe?ref=whoishiring0224
[3]: https://github.com/autodistill/autodistill
[4]: https://github.com/roboflow/supervision
[5]: https://blog.roboflow.com/?ref=whoishiring0224
[6]: https://www.youtube.com/@Roboflow

pytorch-toolbelt

1 1,483 8.3 Python

PyTorch extensions for fast R&D prototyping and Kaggle farming
fastdup

18 1,403 9.4 Python

fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

Project mention: Visualize your dataset using DINOv2 embedding | news.ycombinator.com | 2023-05-02

Visualizing your dataset (especially large ones) in a low-dimensional embedding space can tell you a lot about the patterns and clusters in your dataset.
We recently release a notebook showing how you can visualize your dataset using DINOv2 models by running it on your CPU.
Yes! No GPUs needed.
We used it to find clusters of similar images, duplicates, and outliers in a subset of the LAION dataset
Try it on your own dataset:
Colab notebook - https://colab.research.google.com/github/visual-layer/fastdup/blob/main/examples/dinov2_notebook.ipynb
GitHub repo - https://github.com/visual-layer/fastdup

involution

6 1,306 0.0 Python

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
Unsupervised-Classification

2 1,306 1.4 Python

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
private-detector

5 1,252 4.6 Python

Bumble's Private Detector - a pretrained model for detecting lewd images

Project mention: I made a social media app | /r/webdev | 2023-12-08

Here’s one made by bumble

cleanvision

4 921 7.3 Python

Automatically find issues in image datasets and practice data-centric computer vision.
SimMIM

1 860 0.0 Python

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python image-classification related posts

I made a social media app
1 project | /r/webdev | 8 Dec 2023
Samsung expected to report 80% profit plunge as losses mount at chip business
3 projects | news.ycombinator.com | 10 Oct 2023
Is it easier to go from Pytorch to TF and Keras than the other way around?
1 project | /r/pytorch | 13 May 2023
Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
2 projects | news.ycombinator.com | 9 Apr 2023
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
1 project | /r/neuralnetworks | 9 Apr 2023
Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
1 project | /r/hypeurls | 9 Apr 2023
PyTorch at the Edge: Deploy 964 TIMM Models on Android with TorchScript
2 projects | news.ycombinator.com | 14 Feb 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source image-classification projects in Python? This list will help you:

	Project	Stars
1	pytorch-image-models	29,751
2	ultralytics	22,624
3	vit-pytorch	17,910
4	albumentations	13,395
5	Swin-Transformer	12,917
6	pytorch-grad-cam	9,410
7	autogluon	7,091
8	gluon-cv	5,751
9	PaddleClas	5,251
10	hub	3,436
11	catalyst	3,223
12	mmpretrain	3,156
13	efficientnet	2,057
14	sparseml	1,974
15	ailia-models	1,814
16	autodistill	1,520
17	pytorch-toolbelt	1,483
18	fastdup	1,403
19	involution	1,306
20	Unsupervised-Classification	1,306
21	private-detector	1,252
22	cleanvision	921
23	SimMIM	860