Search Results for author: Chang-Han Rhee

Found 1 papers, 0 papers with code

Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise

no code implementations ICLR 2022 Xingyu Wang, Sewoong Oh, Chang-Han Rhee

The empirical success of deep learning is often attributed to SGD's mysterious ability to avoid sharp local minima in the loss landscape, as sharp minima are known to lead to poor generalization.

Cannot find the paper you are looking for? You can Submit a new open access paper.