Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Why do you think that https://github.com/lucidrains/memory-efficient-attention-pytorch is a good alternative to vit-pytorch