Search Results for author: Aditya Cowsik

Found 3 papers, 0 papers with code

Geometric Dynamics of Signal Propagation Predict Trainability of Transformers

no code implementations5 Mar 2024 Aditya Cowsik, Tamra Nebabu, Xiao-Liang Qi, Surya Ganguli

Our update equations show that without MLP layers, this system will collapse to a line, consistent with prior work on rank collapse in transformers.

Flatter, faster: scaling momentum for optimal speedup of SGD

no code implementations28 Oct 2022 Aditya Cowsik, Tankut Can, Paolo Glorioso

Commonly used optimization algorithms often show a trade-off between good generalization and fast training times.

Breast Cancer Diagnosis by Higher-Order Probabilistic Perceptrons

no code implementations15 Dec 2019 Aditya Cowsik, John W. Clark

The present machine-learning approach to diagnosis (known as HOPP, for higher-order probabilistic perceptron) is tested on the much-studied, open-access Breast Cancer Wisconsin (Diagnosis) Data Set of Wolberg et al.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.