Search Results for author: Pascal Jr. Tikeng Notsawo

Found 1 papers, 0 papers with code

Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok

no code implementations23 Jun 2023 Pascal Jr. Tikeng Notsawo, Hattie Zhou, Mohammad Pezeshki, Irina Rish, Guillaume Dumas

In essence, by studying the learning curve of the first few epochs, we show that one can predict whether grokking will occur later on.

Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.