dino
pytorch-metric-learning
Our great sponsors
dino | pytorch-metric-learning | |
---|---|---|
7 | 3 | |
5,854 | 5,764 | |
3.4% | - | |
1.0 | 7.9 | |
20 days ago | 30 days ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dino
- Batch-wise processing or image-by-image processing? (DINO V1)
-
[P] Image search with localization and open-vocabulary reranking.
I also implemented one based on the self attention maps from the DINO trained ViT’s. This worked pretty well when the attention maps were combined with some traditional computer vision to get bounding boxes. It seemed an ok compromise between domain specialization and location specificity. I did not try any saliency or gradient based methods as i was not sure on generalization and speed respectively. I know LAVIS has an implementation of grad cam and it seems to work well in the plug'n'play vqa.
-
Unsupervised semantic segmentation
You will probably need an unwieldy amount of data and compute to reproduce it, so your best option would be to use the pretrained models available on github.
-
[D] Why Transformers are taking over the Compute Vision world: Self-Supervised Vision Transformers with DINO explained in 7 minutes!
[Full Explanation Post] [Arxiv] [Project Page]
-
A major part of real-world AI has to be solved to make unsupervised, generalized full self-driving work, as the entire road system is designed for biological neural nets with optical imagers
Except he is actually talking about the new DINO model created by facebook that was released on friday. Which is a new approach to image transformers for unsupervised segmentation. Here's its github.
-
[D] Paper Explained - DINO: Emerging Properties in Self-Supervised Vision Transformers (Full Video Analysis)
Code: https://github.com/facebookresearch/dino
- [R] DINO and PAWS: Advancing the state of the art in computer vision with self-supervised Transformers
pytorch-metric-learning
-
Similarity Learning lacks a framework. So we built one
Not a full featured framework, but pytorch-metric-learning has data loaders, lossess, etc. to facilitate similarity learning: https://github.com/KevinMusgrave/pytorch-metric-learning
Disclaimer: I've made some contributions to it.
-
[R][D] VAE Embedding Space - Can we force it to learn a metric?
You can use the triplet loss together with the Gaussian prior. It will be zero centered though and the clusters are not as separated when you use the triplet loss only.There are many alternative to the triplet loss, in case it needs to be a metric: https://github.com/KevinMusgrave/pytorch-metric-learning
-
[D] Similar Image Retrieval
This repo provides the tools and examples needed to build such a model: https://github.com/KevinMusgrave/pytorch-metric-learning
What are some alternatives?
simsiam-cifar10 - Code to train the SimSiam model on cifar10 using PyTorch
lightly - A python library for self-supervised learning on images.
Transformer-SSL - This is an official implementation for "Self-Supervised Learning with Swin Transformers".
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
pytorch-lightning - Build high-performance AI models with PyTorch Lightning (organized PyTorch). Deploy models with Lightning Apps (organized Python to build end-to-end ML systems). [Moved to: https://github.com/Lightning-AI/lightning]
byol-pytorch - Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch
unsupervised-depth-completion-visual-inertial-odometry - Tensorflow and PyTorch implementation of Unsupervised Depth Completion from Visual Inertial Odometry (in RA-L January 2020 & ICRA 2020)
autogluon - Fast and Accurate ML in 3 Lines of Code
solo-learn - solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
similarity - TensorFlow Similarity is a python package focused on making similarity learning quick and easy.