code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper
Why do you think that https://github.com/worldbank/REaLTabFormer is a good alternative to TinyStories
code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper
Why do you think that https://github.com/worldbank/REaLTabFormer is a good alternative to TinyStories