Basic deepspeed_testing repo stats
8 days ago
Similar projects and alternatives to deepspeed_testing
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
NOTE: The number of mentions on this list indicates mentions on common posts. Hence, a higher number means a better deepspeed_testing alternative or higher similarity.
Posts where deepspeed_testing has been mentioned. We have used some of these posts to build our list of alternatives and similar projects - the last one was on 2021-05-03.
DeepSpeed Investigation: What I Learned
dev.to | 2021-05-03
To test out DeepSpeed, I used the awesome HuggingFace transformers library, which supports using DeepSpeed on their non-stable branch (though support is coming to the stable branch in 4.6 🤓). I followed these awesome instructions on the HuggingFace’s website for getting started with DeepSpeed and HuggingFace. If you want to follow along at home, I created a Github repository with the Dockerfile (I’m addicted to docker and will probably make a blog post on docker too :)) and the test script I used to run my experiments on. I tried training the different versions of the awesome T5 model that ranged from smallish ~60 million parameters to humungous 3 billion parameters. And here are my results: