Search Results for author: Robin Yadav

Found 1 papers, 0 papers with code

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

no code implementations29 Feb 2024 Frederik Kunstner, Robin Yadav, Alan Milligan, Mark Schmidt, Alberto Bietti

We show that the heavy-tailed class imbalance found in language modeling tasks leads to difficulties in the optimization dynamics.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.