Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Why do you think that https://github.com/facebookresearch/dadaptation is a good alternative to Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Why do you think that https://github.com/facebookresearch/dadaptation is a good alternative to Adan