Search Results for author: Sam S. Schoenholz

Found 1 papers, 0 papers with code

Deep Mean Field Theory: Layerwise Variance and Width Variation as Methods to Control Gradient Explosion

no code implementations • ICLR 2018 • Greg Yang, Sam S. Schoenholz

Using the obtained mean field theory, we are able to track surprisingly well how VV at initialization time affects training and test time performance on MNIST after a set number of epochs: the level sets of test/train set accuracies coincide with the level sets of the expectations of certain gradient norms or of metric expressivity (as defined in \cite{yang_meanfield_2017}), a measure of expansion in a random neural network.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.