Search Results for author: Kartik Ahuja

Found 32 papers, 20 papers with code

Robust Data Pruning: Uncovering and Overcoming Implicit Bias

2 code implementations8 Apr 2024 Artem Vysogorets, Kartik Ahuja, Julia Kempe

However, little is known about its impact on classification bias of the trained models.

Fairness

On Provable Length and Compositional Generalization

no code implementations7 Feb 2024 Kartik Ahuja, Amin Mansouri

Length generalization -- the ability to generalize to longer sequences than ones seen during training, and compositional generalization -- the ability to generalize to token combinations not seen during training, are crucial forms of out-of-distribution generalization in sequence-to-sequence models.

Out-of-Distribution Generalization

Multi-Domain Causal Representation Learning via Weak Distributional Invariances

no code implementations4 Oct 2023 Kartik Ahuja, Amin Mansouri, Yixin Wang

Causal representation learning has emerged as the center of action in causal machine learning research.

Representation Learning

On the Identifiability of Quantized Factors

1 code implementation28 Jun 2023 Vitória Barin-Pacela, Kartik Ahuja, Simon Lacoste-Julien, Pascal Vincent

We introduce this novel form of identifiability, termed quantized factor identifiability, and provide a comprehensive proof of the recovery of the quantized factors.

Disentanglement Inductive Bias

A Closer Look at In-Context Learning under Distribution Shifts

1 code implementation26 May 2023 Kartik Ahuja, David Lopez-Paz

In-context learning, a capability that enables a model to learn from input examples on the fly without necessitating weight updates, is a defining characteristic of large language models.

In-Context Learning

Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization

1 code implementation20 Dec 2022 Alexandre Ramé, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, Léon Bottou, David Lopez-Paz

In this paper, we thus propose model ratatouille, a new strategy to recycle the multiple fine-tunings of the same foundation model on diverse auxiliary tasks.

Domain Generalization Out-of-Distribution Generalization

FL Games: A Federated Learning Framework for Distribution Shifts

no code implementations31 Oct 2022 Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

Federated learning aims to train predictive models for data that is distributed across clients, under the orchestration of a server.

Federated Learning

Weakly Supervised Representation Learning with Sparse Perturbations

1 code implementation2 Jun 2022 Kartik Ahuja, Jason Hartford, Yoshua Bengio

We show that if the perturbations are applied only on mutually exclusive blocks of latents, we identify the latents up to those blocks.

Representation Learning

Why does Throwing Away Data Improve Worst-Group Error?

no code implementations23 May 2022 Kamalika Chaudhuri, Kartik Ahuja, Martin Arjovsky, David Lopez-Paz

When facing data with imbalanced classes or groups, practitioners follow an intriguing strategy to achieve best results.

Fairness imbalanced classification +1

FL Games: A federated learning framework for distribution shifts

no code implementations23 May 2022 Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

Federated learning aims to train predictive models for data that is distributed across clients, under the orchestration of a server.

Federated Learning

Towards efficient representation identification in supervised learning

1 code implementation10 Apr 2022 Kartik Ahuja, Divyat Mahajan, Vasilis Syrgkanis, Ioannis Mitliagkas

In this work, we depart from these assumptions and ask: a) How can we get disentanglement when the auxiliary information does not provide conditional independence over the factors of variation?

Disentanglement

Properties from Mechanisms: An Equivariance Perspective on Identifiable Representation Learning

no code implementations ICLR 2022 Kartik Ahuja, Jason Hartford, Yoshua Bengio

These results suggest that by exploiting inductive biases on mechanisms, it is possible to design a range of new identifiable representation learning approaches.

Representation Learning

Locally Invariant Explanations: Towards Causal Explanations through Local Invariant Learning

no code implementations29 Sep 2021 Amit Dhurandhar, Karthikeyan Natesan Ramamurthy, Kartik Ahuja, Vijay Arya

Locally interpretable model agnostic explanations (LIME) method is one of the most popular methods used to explain black-box models at a per example level.

Out-of-Distribution Generalization

Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge

1 code implementation22 Jun 2021 Abhin Shah, Karthikeyan Shanmugam, Kartik Ahuja

Our main result strengthens these prior results by showing that under a different expert-driven structural knowledge -- that one variable is a direct causal parent of treatment variable -- remarkably, testing for subsets (not involving the known parent variable) that are valid back-doors is equivalent to an invariance test.

Causal Inference Representation Learning +1

Can Subnetwork Structure be the Key to Out-of-Distribution Generalization?

no code implementations5 Jun 2021 Dinghuai Zhang, Kartik Ahuja, Yilun Xu, Yisen Wang, Aaron Courville

Can models with particular structure avoid being biased towards spurious correlation in out-of-distribution (OOD) generalization?

Out-of-Distribution Generalization

SAND-mask: An Enhanced Gradient Masking Strategy for the Discovery of Invariances in Domain Generalization

2 code implementations4 Jun 2021 Soroosh Shahtalebi, Jean-Christophe Gagnon-Audet, Touraj Laleh, Mojtaba Faramarzi, Kartik Ahuja, Irina Rish

A major bottleneck in the real-world applications of machine learning models is their failure in generalizing to unseen domains whose data distribution is not i. i. d to the training domains.

Domain Generalization

Treatment Effect Estimation using Invariant Risk Minimization

2 code implementations13 Mar 2021 Abhin Shah, Kartik Ahuja, Karthikeyan Shanmugam, Dennis Wei, Kush Varshney, Amit Dhurandhar

Inferring causal individual treatment effect (ITE) from observational data is a challenging problem whose difficulty is exacerbated by the presence of treatment assignment bias.

Domain Generalization regression

Learning to Initialize Gradient Descent Using Gradient Descent

no code implementations22 Dec 2020 Kartik Ahuja, Amit Dhurandhar, Kush R. Varshney

Non-convex optimization problems are challenging to solve; the success and computational expense of a gradient descent algorithm or variant depend heavily on the initialization strategy.

Empirical or Invariant Risk Minimization? A Sample Complexity Perspective

3 code implementations ICLR 2021 Kartik Ahuja, Jun Wang, Amit Dhurandhar, Karthikeyan Shanmugam, Kush R. Varshney

Recently, invariant risk minimization (IRM) was proposed as a promising solution to address out-of-distribution (OOD) generalization.

Linear Regression Games: Convergence Guarantees to Approximate Out-of-Distribution Solutions

3 code implementations28 Oct 2020 Kartik Ahuja, Karthikeyan Shanmugam, Amit Dhurandhar

In Ahuja et al., it was shown that solving for the Nash equilibria of a new class of "ensemble-games" is equivalent to solving IRM.

regression

Adversarial Feature Desensitization

1 code implementation NeurIPS 2021 Pouya Bashivan, Reza Bayat, Adam Ibrahim, Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Aaron Richards, Irina Rish

Our method, called Adversarial Feature Desensitization (AFD), aims at learning features that are invariant towards adversarial perturbations of the inputs.

Adversarial Robustness Domain Adaptation +1

Invariant Risk Minimization Games

3 code implementations ICML 2020 Kartik Ahuja, Karthikeyan Shanmugam, Kush R. Varshney, Amit Dhurandhar

The standard risk minimization paradigm of machine learning is brittle when operating in environments whose test distributions are different from the training distribution due to spurious correlations.

BIG-bench Machine Learning Image Classification

Estimating Kullback-Leibler Divergence Using Kernel Machines

1 code implementation2 May 2019 Kartik Ahuja

Recently, a method called the Mutual Information Neural Estimator (MINE) that uses neural networks has been proposed to estimate mutual information and more generally the Kullback-Leibler (KL) divergence between two distributions.

Risk-Stratify: Confident Stratification Of Patients Based On Risk

no code implementations2 Nov 2018 Kartik Ahuja, Mihaela van der Schaar

A clinician desires to use a risk-stratification method that achieves confident risk-stratification - the risk estimates of the different patients reflect the true risks with a high probability.

Joint Concordance Index

1 code implementation26 Oct 2018 Kartik Ahuja, Mihaela van der Schaar

We use the new metric to develop a variable importance ranking approach.

Methodology

Optimal Piecewise Local-Linear Approximations

1 code implementation27 Jun 2018 Kartik Ahuja, William Zame, Mihaela van der Schaar

Piecewise local-linear models provide a natural way to extend local-linear models to explain the global behavior of the model.

Clustering

DPSCREEN: Dynamic Personalized Screening

no code implementations NeurIPS 2017 Kartik Ahuja, William Zame, Mihaela van der Schaar

However, there has been limited work to address the personalized screening for these different diseases.

Cannot find the paper you are looking for? You can Submit a new open access paper.