Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
Why do you think that https://github.com/Deci-AI/super-gradients is a good alternative to nested-transformer
Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
Why do you think that https://github.com/Deci-AI/super-gradients is a good alternative to nested-transformer