Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
Why do you think that https://github.com/huggingface/transformers is a good alternative to sru
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
Why do you think that https://github.com/huggingface/transformers is a good alternative to sru