Quasi Hyperbolic Rectified DEMON Adam/Amsgrad with AdaMod, Gradient Centralization, Lookahead, iterative averaging and decorrelated Weight Decay
Why do you think that https://github.com/Rishit-dagli/Gradient-Centralization-TensorFlow is a good alternative to DemonRangerOptimizer