1 code implementation • 27 Feb 2024 • Patrick Pynadath, Riddhiman Bhattacharya, Arun Hariharan, Ruqi Zhang
Discrete distributions, particularly in high-dimensional deep models, are often highly multimodal due to inherent discontinuities.
no code implementations • 14 Dec 2021 • Riddhiman Bhattacharya, Tiefeng Jiang
Past research has indicated that the covariance of the Stochastic Gradient Descent (SGD) error done via minibatching plays a critical role in determining its regularization and escape from low potential points.