⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Why do you think that https://github.com/NVIDIA/FasterTransformer is a good alternative to fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Why do you think that https://github.com/NVIDIA/FasterTransformer is a good alternative to fastT5