D-Adaptation for SGD, Adam and AdaGrad
Why do you think that https://github.com/google-research/tuning_playbook is a good alternative to dadaptation
D-Adaptation for SGD, Adam and AdaGrad
Why do you think that https://github.com/google-research/tuning_playbook is a good alternative to dadaptation