PaddleViT
PyTorch-Vision-Transformer-ViT-MNIST-CIFAR10
Our great sponsors
PaddleViT | PyTorch-Vision-Transformer-ViT-MNIST-CIFAR10 | |
---|---|---|
2 | 2 | |
1,169 | 59 | |
- | - | |
0.0 | 8.8 | |
over 1 year ago | 12 days ago | |
Python | Python | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PaddleViT
-
Researchers From China Propose A Pale-Shaped Self-Attention (PS-Attention) And A General Vision Transformer Backbone, Called Pale Transformer
Code for https://arxiv.org/abs/2112.14000 found: https://github.com/BR-IDL/PaddleViT
Github: https://github.com/BR-IDL/PaddleViT
PyTorch-Vision-Transformer-ViT-MNIST-CIFAR10
-
Scratch Implementation of Vision Transformer in PyTorch
In the encoder class, ViTs use prenorm not post-norm like BERT. That is the first norm layer should be before the attention and the second norm layer should be before "self.fc1" https://github.com/s-chh/PyTorch-Vision-Transformer-ViT-MNIST/blob/main/model.py
What are some alternatives?
FastestDet - :zap: A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler
SwinIR - SwinIR: Image Restoration Using Swin Transformer (official repository)
medicaldetectiontoolkit - The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.
towhee - Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
mmrazor - OpenMMLab Model Compression Toolbox and Benchmark.
Efficient-AI-Backbones - Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
SINet - Camouflaged Object Detection, CVPR 2020 (Oral)
mmdetection - OpenMMLab Detection Toolbox and Benchmark
ttach - Image Test Time Augmentation with PyTorch!
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
ml-cvnets - CVNets: A library for training computer vision networks
sigmarsgarden - Opus Magnum's Sigmar's Garden Autosolver, using OpenCV Template Matching