Search Results for author: Kangqiao Liu

Logarithmic landscape and power-law escape rate of SGD

Stochastic gradient descent (SGD) undergoes complicated multiplicative noise for the mean-square loss.

Paper
Add Code

Stochastic gradient descent (SGD) undergoes complicated multiplicative noise for the mean-square loss.

Paper
Add Code

The noise in stochastic gradient descent (SGD), caused by minibatch sampling, is poorly understood despite its practical importance in deep learning.

Paper
Add Code

In the vanishing learning rate regime, stochastic gradient descent (SGD) is now relatively well understood.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.