Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Why do you think that https://github.com/lucidrains/Adan-pytorch is a good alternative to Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Why do you think that https://github.com/lucidrains/Adan-pytorch is a good alternative to Adan