Search Results for author: Prin Phunyaphibarn

Found 1 papers, 0 papers with code

Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study

no code implementations25 Nov 2023 Prin Phunyaphibarn, Junghyun Lee, Bohan Wang, Huishuai Zhang, Chulhee Yun

Although gradient descent with momentum is widely used in modern deep learning, a concrete understanding of its effects on the training trajectory still remains elusive.

Cannot find the paper you are looking for? You can Submit a new open access paper.