PyTorch extensions for high performance and large scale training.
Why do you think that https://github.com/bigscience-workshop/Megatron-DeepSpeed is a good alternative to fairscale
PyTorch extensions for high performance and large scale training.
Why do you think that https://github.com/bigscience-workshop/Megatron-DeepSpeed is a good alternative to fairscale