An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Why do you think that https://github.com/photopea/photopea is a good alternative to gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Why do you think that https://github.com/photopea/photopea is a good alternative to gpt-neox