SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python image-classification Projects
-
Project mention: Show HN: Using YOLO to Detect Office Chairs in 40M Hotel Photos | news.ycombinator.com | 2025-01-25
They did it on their own computer. https://github.com/ultralytics/ultralytics
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Project mention: This PR content was generated automatically using cover-agent | news.ycombinator.com | 2024-11-19Those are some pointless tests.
E.g. test_activation_stats_functions [1] that just checks that the returned value is a float, and that it can take random numbers as input.
test_get_state_dict_custom_unwrap [2] is probably supposed to check that custom_unwrap is invoked, but since it doesn't either record being called, or transform its input, the assertions can't actually check that it was called.
[1] https://github.com/huggingface/pytorch-image-models/pull/233...
[2] https://github.com/huggingface/pytorch-image-models/pull/233...
-
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
-
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Project mention: Albumentations: Fast and flexible image augmentation library | news.ycombinator.com | 2025-02-22 -
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
-
pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
-
Project mention: Launch HN: Enhanced Radar (YC W25) – A safety net for air traffic control | news.ycombinator.com | 2025-03-04
Are there already bird not a bird datasets?
Procedures for creating "bird on Multispectral plane radar and video" dataset(s):
Tag birds on the dashcam video with timecoded sensor data and a segmentation and annotation tool.
Pinch to zoom, auto-edge detect, classification probability, sensor status
voxel51/fiftyone does segmentation and annotation with video and possibly Multispectral data: https://github.com/voxel51/fiftyone
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Project mention: InternVL3: Unified Multimodal AI Training Outperforms Open-Source Rivals | dev.to | 2025-04-17InternVL3 marks a significant advancement in the InternVL model series, implementing a native multimodal pre-training approach that fundamentally transforms how vision-language models learn. Unlike most leading multimodal large language models (MLLMs) that adapt text-only models to handle visual inputs through complex post-hoc alignment, InternVL3 jointly acquires multimodal and linguistic capabilities in a single unified pre-training stage.
-
-
-
-
-
-
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
-
-
sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
-
-
fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
-
-
Unsupervised-Classification
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
-
-
-
involution
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python image-classification discussion
Python image-classification related posts
-
Show HN: Local, automatic, image keywords, captions using metadata for storage
-
Alternatives to Cosine Similarity
-
I made a social media app
-
Samsung expected to report 80% profit plunge as losses mount at chip business
-
Is it easier to go from Pytorch to TF and Keras than the other way around?
-
Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
-
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
-
A note from our sponsor - SaaSHub
www.saashub.com | 12 May 2025
Index
What are some of the best open-source image-classification projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | ultralytics | 40,251 |
2 | pytorch-image-models | 34,064 |
3 | vit-pytorch | 22,795 |
4 | albumentations | 14,908 |
5 | Swin-Transformer | 14,746 |
6 | pytorch-grad-cam | 11,612 |
7 | fiftyone | 9,461 |
8 | InternVL | 8,015 |
9 | gluon-cv | 5,882 |
10 | PaddleClas | 5,644 |
11 | mmpretrain | 3,649 |
12 | hub | 3,496 |
13 | catalyst | 3,336 |
14 | autodistill | 2,252 |
15 | ailia-models | 2,191 |
16 | sparseml | 2,131 |
17 | efficientnet | 2,083 |
18 | fastdup | 1,680 |
19 | pytorch-toolbelt | 1,541 |
20 | Unsupervised-Classification | 1,418 |
21 | private-detector | 1,330 |
22 | poolformer | 1,326 |
23 | involution | 1,307 |