Search Results for author: Victor Luo

Found 2 papers, 0 papers with code

SGD Distributional Dynamics of Three Layer Neural Networks

no code implementations30 Dec 2020 Victor Luo, Yazhen Wang, Glenn Fung

In this paper, we seek to extend the mean field results of Mei et al. (2018) from two-layer neural networks with one hidden layer to three-layer neural networks with two hidden layers.

How Many Factors Influence Minima in SGD?

no code implementations24 Sep 2020 Victor Luo, Yazhen Wang

The influencing factors identified in the literature include learning rate, batch size, Hessian, and gradient covariance, and stochastic differential equations are used to model SGD and establish the relationships among these factors for characterizing minima found by SGD.

Cannot find the paper you are looking for? You can Submit a new open access paper.