examples
pytorch-image-models
Our great sponsors
examples | pytorch-image-models | |
---|---|---|
5 | 35 | |
376 | 29,751 | |
10.9% | 2.9% | |
6.8 | 9.4 | |
3 months ago | 5 days ago | |
Jupyter Notebook | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
examples
- FLaNK Stack Weekly for 07August2023
-
Vector database built for scalable similarity search
As another commenter noted, Milvus is overkill and a "bit much" if you're learning/playing.
A good intro to the field with progression towards a full Milvus implementation could be starting with towhee[0] (which is also supported by Milvus).
towhee has an example to do exactly what you want with CLIP[1].
[0] - https://towhee.io/
[1] - https://github.com/towhee-io/examples/tree/main/image/text_i...
-
Ask HN: Any good self-hosted image recognition software?
Usually this is done in three steps. The first step is using a neural network to create a bounding box around the object, then generating vector embeddings of the object, and then using similarity search on vector embeddings.
The first step is accomplished by training a detection model to generate the bounding box around your object, this can usually be done by finetuning an already trained detection model. For this step the data you would need is all the images of the object you have with a bounding box created around it, the version of the object doesnt matter here.
The second step involves using a generalized image classification model thats been pretrained on generalized data (VGG, etc.) and a vector search engine/vector database. You would start by using the image classification model to generate vector embeddings (https://frankzliu.com/blog/understanding-neural-network-embe...) of all the different versions of the object. The more ground truth images you have, the better, but it doesn't require the same amount as training a classifier model. Once you have your versions of the object as embeddings, you would store them in a vector database (for example Milvus: https://github.com/milvus-io/milvus).
Now whenever you want to detect the object in an image you can run the image through the detection model to find the object in the image, then run the sliced out image of the object through the vector embedding model. With this vector embedding you can then perform a search in the vector database, and the closest results will most likely be the version of the object.
Hopefully this helps with the general rundown of how it would look like. Here is an example using Milvus and Towhee https://github.com/towhee-io/examples/tree/3a2207d67b10a246f....
Disclaimer: I am a part of those two open source projects.
-
Deep Dive into Real-World Image Search Engine with Python
I have shown how to Build an Image Search Engine in Minutes in the previous tutorial. Here is another one for how to optimize the algorithm, feed it with large-scale image datasets, and deploy it as a micro-service.
-
Build an Image Search Engine in Minutes
The full tutorial is at https://github.com/towhee-io/examples/blob/main/image/reverse_image_search/build_image_search_engine.ipynb
pytorch-image-models
- FLaNK AI Weekly 18 March 2024
-
[D] Hugging face and Timm
I am a PyTorch user I work in CV, I usually use the PyTorch models. However, I see people use timm in research papers to train their models I don't understand what it is timm is it a new framework like PyTorch? Further, when I click https://pypi.org/project/timm/ homepage it takes me to hugging face GitHub https://github.com/huggingface/pytorch-image-models is there any connection between timm and hugging face many of my friends use hugging face but I also don't know about hugging face I use simple PyTorch and torchvision.models.
-
FLaNK Stack Weekly for 07August2023
https://github.com/huggingface/pytorch-image-models https://huggingface.co/docs/timm/index
-
[R] Nvidia RTX 4090 ML benchmarks. Under QEMU/KVM. Image + Transformers. FP16/FP32.
pytorch-image-models
-
Inference on resent, cant work out the problem?
additionally, you might find the timm library handy for this sort of work.
-
Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
This is still being pursued. Ross Wightmann's timm[0,1] package (now on Hugging Face) has done a lot of this. There's also a V2 of ConvNext[2]. Ross does write about this a lot on Twitter fwiw. I should also mention that there are still many transformer based networks that still beat convs. So there probably won't be a resurgence in convs until someone can show that there's a really strong reason for them. They have some advantages but they also might not be flexible enough for the long range tasks in segmentation and detection. But maybe they are.
FAIR definitely did great work with ConvNext, and I do hope to see more. There always needs to be people pushing unpopular paradigms.
[0] https://github.com/huggingface/pytorch-image-models
[1] https://arxiv.org/abs/2110.00476
[2] https://arxiv.org/abs/2301.00808
-
Problems with Learning Rate Finder in Pytorch Lightning
I am doing Binary classification with a pre-trained EfficientNet tf_efficientnet_l2. I froze all weights during training and replaced the classifier with a custom trainable one that looks like:
-
PyTorch at the Edge: Deploying Over 964 TIMM Models on Android with TorchScript and Flutter
In this post, Iām going to show you how you can pick from over 900+ SOTA models on TIMM, train them using best practices with Fastai, and deploy them on Android using Flutter.
-
ImageNet Advise
The other thing is, try to find tricks to speed up your experiments (if not having done so already). The most obvious are mixed precision training, have your model train on a lower resolution input first and then increase the resolution later in the training, stochastic depth, and a bunch more stuffs. Look for implementations in https://github.com/rwightman/pytorch-image-models .
- Doubt about transformers
What are some alternatives?
towhee - Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
yolov5 - YOLOv5 š in PyTorch > ONNX > CoreML > TFLite
milvus-lite - A lightweight version of Milvus wrapped with Python.
mmdetection - OpenMMLab Detection Toolbox and Benchmark
gorilla-cli - LLMs for your CLI
detectron2 - Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
anomalib - An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
mmcv - OpenMMLab Computer Vision Foundation
EverythingApacheNiFi - EverythingApacheNiFi
segmentation_models.pytorch - Segmentation models with pretrained backbones. PyTorch.
harlequin - The SQL IDE for Your Terminal.
yolact - A simple, fully convolutional model for real-time instance segmentation.