Fast-Transformer VS Transformer-in-Transformer

Compare Fast-Transformer vs Transformer-in-Transformer and see what are their differences.


An implementation of Fastformer: Additive Attention Can Be All You Need, a Transformer Variant in TensorFlow (by Rishit-dagli)


An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches (by Rishit-dagli)
Our great sponsors
  • Onboard AI - Learn any GitHub repo in 59 seconds
  • InfluxDB - Collect and Analyze Billions of Data Points in Real Time
  • SaaSHub - Software Alternatives and Reviews
Fast-Transformer Transformer-in-Transformer
4 4
146 40
- -
3.2 0.0
almost 2 years ago almost 2 years ago
Jupyter Notebook Jupyter Notebook
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.


Posts with mentions or reviews of Fast-Transformer. We have used some of these posts to build our list of alternatives and similar projects.

We haven't tracked posts mentioning Fast-Transformer yet.
Tracking mentions began in Dec 2020.


Posts with mentions or reviews of Transformer-in-Transformer. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-12-06.

What are some alternatives?

When comparing Fast-Transformer and Transformer-in-Transformer you can also consider the following projects:

reformer-pytorch - Reformer, the efficient Transformer, in Pytorch

Perceiver - Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

poolformer - PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)

Conformer - An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras

LongNet - Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

AvatarGAN - Generate Cartoon Images using Generative Adversarial Network

TimeSformer-pytorch - Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

machine-learning-experiments - 🤖 Interactive Machine Learning experiments: 🏋️models training + 🎨models demo

swarms - Build, Deploy, and Scale Reliable Swarms of Autonomous Agents. Join our Community:

embedding-encoder - Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.

ML-Workspace - 🛠 All-in-one web-based IDE specialized for machine learning and data science.

planckforth - Bootstrapping a Forth interpreter from hand-written tiny ELF binary. Just for fun.