1 code implementation • 29 Nov 2023 • Sungbin Shin, Dongyeop Lee, Maksym Andriushchenko, Namhoon Lee
Training an overparameterized neural network can yield minimizers of different generalization capabilities despite the same level of training loss.