Adaptive Gradient Descent for Convex and Non-Convex Stochastic Optimization

19 Nov 2019  ·  Aleksandr Ogaltsov, Darina Dvinskikh, Pavel Dvurechensky, Alexander Gasnikov, Vladimir Spokoiny ·

In this paper we propose several adaptive gradient methods for stochastic optimization. Our methods are based on Armijo-type line search and they simultaniously adapt to the unknown Lipschitz constant of the gradient and variance of the stochastic approximation for the gradient. We consider an accelerated gradient descent for convex problems and gradient descent for non-convex problems. In the experiments we demonstrate superiority of our methods to existing adaptive methods, e.g. AdaGrad and Adam.

PDF Abstract