An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Why do you think that https://github.com/EleutherAI/DALLE-mtf is a good alternative to gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Why do you think that https://github.com/EleutherAI/DALLE-mtf is a good alternative to gpt-neo