vit-explain
how-do-vits-work
Our great sponsors
vit-explain | how-do-vits-work | |
---|---|---|
2 | 3 | |
708 | 784 | |
- | - | |
0.0 | 0.0 | |
about 2 years ago | almost 2 years ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vit-explain
- [D] Off-the-shelf image saliency scoring models?
-
Explainability for Vision Transformers TF Implementation
Im trying to implement this code of visualization/explainability of ViT's in tensorflow but im having trouble trying to find a similar function of the module.RegisterForwardHook for TF. Any ideas on how can I do it?
how-do-vits-work
-
A New Deep Learning Study Investigate and Clarify the Intrinsic Behavior of Transformers in Computer Vision
Github: https://github.com/xxxnell/how-do-vits-work
-
[D] Paper Explained – How Do Vision Transformers Work?
Code for https://arxiv.org/abs/2202.06709 found: https://github.com/xxxnell/how-do-vits-work
- How Do Vision Transformers Work?
What are some alternatives?
captum - Model interpretability and understanding for PyTorch
Parallel-Tacotron2 - PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
mmdetection - OpenMMLab Detection Toolbox and Benchmark
awesome-fast-attention - list of efficient attention modules
pytorch-grad-cam - Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
MPViT - [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
OpenPrompt - An Open-Source Framework for Prompt-Learning.
query-selector - LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
attention_to_gif - Visualize transition of attention weights across layers in a Transformer as a GIF