This repository contains Tensorflow 2 code for Attention Mechanisms chapter of Dive into Deep Learning (D2L) book.
Why do you think that https://github.com/Rishit-dagli/Transformer-in-Transformer is a good alternative to D2L_Attention_Mechanisms_in_TF