Search Results for author: Parthe Pandit

Found 18 papers, 5 papers with code

On the Nystrom Approximation for Preconditioning in Kernel Machines

no code implementations • 6 Dec 2023 • Amirhesam Abedsoltan, Parthe Pandit, Luis Rademacher, Mikhail Belkin

Scalable algorithms for learning kernel models need to be iterative in nature, but convergence can be slow due to poor conditioning.

Paper
Add Code

Mechanism of feature learning in convolutional neural networks

1 code implementation • 1 Sep 2023 • Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin

We then demonstrate the generality of our result by using the patch-based AGOP to enable deep feature learning in convolutional kernel machines.

Paper
Code

Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks

no code implementations • 14 May 2023 • Evan Becker, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

We study the local dynamics of GDA for training a GAN with a kernel-based discriminator.

Paper
Add Code

Toward Large Kernel Models

1 code implementation • 6 Feb 2023 • Amirhesam Abedsoltan, Mikhail Belkin, Parthe Pandit

Recent studies indicate that kernel machines can often perform similarly or better than deep neural networks (DNNs) on small datasets.

Paper
Code

Mechanism of feature learning in deep fully connected networks and kernel machines that recursively learn features

3 code implementations • 28 Dec 2022 • Adityanarayanan Radhakrishnan, Daniel Beaglehole, Parthe Pandit, Mikhail Belkin

In recent years neural networks have achieved impressive results on many technological and scientific tasks.

Paper
Code

Instability and Local Minima in GAN Training with Kernel Discriminators

no code implementations • 21 Aug 2022 • Evan Becker, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

Generative Adversarial Networks (GANs) are a widely-used tool for generative modeling of complex data.

Paper
Add Code

Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting

no code implementations • 14 Jul 2022 • Neil Mallinar, James B. Simon, Amirhesam Abedsoltan, Parthe Pandit, Mikhail Belkin, Preetum Nakkiran

In this work we argue that while benign overfitting has been instructive and fruitful to study, many real interpolating methods like neural networks do not fit benignly: modest noise in the training set causes nonzero (but non-infinite) excess risk at test time, implying these models are neither benign nor catastrophic but rather fall in an intermediate regime.

Learning Theory

Paper
Add Code

A note on Linear Bottleneck networks and their Transition to Multilinearity

no code implementations • 30 Jun 2022 • Libin Zhu, Parthe Pandit, Mikhail Belkin

In this work we show that linear networks with a bottleneck layer learn bilinear functions of the weights, in a ball of radius $O(1)$ around initialization.

Paper
Add Code

On the Inconsistency of Kernel Ridgeless Regression in Fixed Dimensions

no code implementations • 26 May 2022 • Daniel Beaglehole, Mikhail Belkin, Parthe Pandit

``Benign overfitting'', the ability of certain algorithms to interpolate noisy training data and yet perform well out-of-sample, has been a topic of considerable recent interest.

regression Translation

Paper
Add Code

Kernel Methods and Multi-layer Perceptrons Learn Linear Models in High Dimensions

no code implementations • 20 Jan 2022 • Mojtaba Sahraee-Ardakan, Melikasadat Emami, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

Empirical observation of high dimensional phenomena, such as the double descent behaviour, has attracted a lot of interest in understanding classical techniques such as kernel methods, and their implications to explain generalization properties of neural networks.

Paper
Add Code

Implicit Bias of Linear RNNs

no code implementations • 19 Jan 2021 • Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

The degree of this bias depends on the variance of the transition kernel matrix at initialization and is related to the classic exploding and vanishing gradients problem.

Paper
Add Code

Matrix Inference and Estimation in Multi-Layer Models

1 code implementation • NeurIPS 2020 • Parthe Pandit, Mojtaba Sahraee Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

In the two-layer neural-network learning problem, this scaling corresponds to the case where the number of input features, as well as training samples, grow to infinity but the number of hidden nodes stays fixed.

Imputation

Paper
Code

Low-Rank Nonlinear Decoding of $μ$-ECoG from the Primary Auditory Cortex

no code implementations • 6 May 2020 • Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Alyson K. Fletcher, Sundeep Rangan, Michael Trumpis, Brinnae Bent, Chia-Han Chiang, Jonathan Viventi

This decoding problem is particularly challenging due to the complexity of neural responses in the auditory cortex and the presence of confounding signals in awake animals.

Dimensionality Reduction

Paper
Add Code

Generalization Error of Generalized Linear Models in High Dimensions

3 code implementations • ICML 2020 • Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

We provide a general framework to characterize the asymptotic generalization error for single-layer neural networks (i. e., generalized linear models) with arbitrary non-linearities, making it applicable to regression as well as classification problems.

BIG-bench Machine Learning regression +1

Paper
Code

Inference in Multi-Layer Networks with Matrix-Valued Unknowns

no code implementations • 26 Jan 2020 • Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

We consider the problem of inferring the input and hidden variables of a stochastic multi-layer neural network from an observation of the output.

Paper
Add Code

Inference with Deep Generative Priors in High Dimensions

no code implementations • 8 Nov 2019 • Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

This paper presents a novel algorithm, Multi-Layer Vector Approximate Message Passing (ML-VAMP), for inference in multi-layer stochastic neural networks.

Vocal Bursts Intensity Prediction

Paper
Add Code

High-Dimensional Bernoulli Autoregressive Process with Long-Range Dependence

no code implementations • 19 Mar 2019 • Parthe Pandit, Mojtaba Sahraee-Ardakan, Arash A. Amini, Sundeep Rangan, Alyson K. Fletcher

We derive precise upper bounds on the mean-squared estimation error in terms of the number of samples, dimensions of the process, the lag $p$ and other key statistical properties of the model.

Gaussian Processes Vocal Bursts Intensity Prediction

Paper
Add Code

Asymptotics of MAP Inference in Deep Networks

no code implementations • 1 Mar 2019 • Parthe Pandit, Mojtaba Sahraee, Sundeep Rangan, Alyson K. Fletcher

Deep generative priors are a powerful tool for reconstruction problems with complex data such as images and text.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.