Search Results for author: Bernhard Schölkopf

Found 343 papers, 142 papers with code

Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents

no code implementations • 25 Apr 2024 • Giorgio Piatti, Zhijing Jin, Max Kleiman-Weiner, Bernhard Schölkopf, Mrinmaya Sachan, Rada Mihalcea

Through this simulation environment, we explore the dynamics of resource sharing among AI agents, highlighting the importance of ethical considerations, strategic planning, and negotiation skills.

Paper
Add Code

Compete and Compose: Learning Independent Mechanisms for Modular World Models

no code implementations • 23 Apr 2024 • Anson Lei, Frederik Nolte, Bernhard Schölkopf, Ingmar Posner

COMET is trained on multiple environments with varying dynamics via a two-step process: competition and composition.

Paper
Add Code

A diverse Multilingual News Headlines Dataset from around the World

1 code implementation • 28 Mar 2024 • Felix Leeb, Bernhard Schölkopf

Babel Briefings is a novel dataset featuring 4. 7 million news headlines from August 2020 to November 2021, across 30 languages and 54 locations worldwide with English translations of all articles included.

Paper
Code

Language Models Can Reduce Asymmetry in Information Markets

no code implementations • 21 Mar 2024 • Nasim Rahaman, Martin Weiss, Manuel Wüthrich, Yoshua Bengio, Li Erran Li, Chris Pal, Bernhard Schölkopf

This work addresses the buyer's inspection paradox for information markets.

Paper
Add Code

Provable Privacy with Non-Private Pre-Processing

no code implementations • 19 Mar 2024 • Yaxi Hu, Amartya Sanyal, Bernhard Schölkopf

When analysing Differentially Private (DP) machine learning pipelines, the potential privacy cost of data-dependent pre-processing is frequently overlooked in privacy accounting.

Imputation Quantization

Paper
Add Code

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends

no code implementations • 12 Mar 2024 • Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf

We propose a fresh take on understanding the mechanisms of neural networks by analyzing the rich structure of parameters contained within their optimization trajectories.

Paper
Add Code

Skill or Luck? Return Decomposition via Advantage Functions

no code implementations • 20 Feb 2024 • Hsiao-Ru Pan, Bernhard Schölkopf

Learning from off-policy data is essential for sample-efficient reinforcement learning.

Paper
Add Code

Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals

1 code implementation • 18 Feb 2024 • Francesco Ortu, Zhijing Jin, Diego Doimo, Mrinmaya Sachan, Alberto Cazzaniga, Bernhard Schölkopf

Interpretability research aims to bridge the gap between the empirical success and our scientific understanding of the inner workings of large language models (LLMs).

Paper
Code

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

no code implementations • 14 Feb 2024 • Goutham Rajendran, Simon Buchholz, Bryon Aragam, Bernhard Schölkopf, Pradeep Ravikumar

In this work, we relate these two approaches and study how to learn human-interpretable concepts from data.

Representation Learning

Paper
Add Code

Limits of Transformer Language Models on Learning Algorithmic Compositions

no code implementations • 8 Feb 2024 • Jonathan Thomm, Aleksandar Terzic, Geethan Karunaratne, Giacomo Camposampiero, Bernhard Schölkopf, Abbas Rahimi

We analyze the capabilities of Transformer language models on learning discrete algorithms.

Paper
Add Code

The Essential Role of Causality in Foundation World Models for Embodied AI

no code implementations • 6 Feb 2024 • Tarun Gupta, Wenbo Gong, Chao Ma, Nick Pawlowski, Agrin Hilmkil, Meyer Scetbon, Ade Famoti, Ashley Juan Llorens, Jianfeng Gao, Stefan Bauer, Danica Kragic, Bernhard Schölkopf, Cheng Zhang

This paper focuses on the prospects of building foundation world models for the upcoming generation of embodied agents and presents a novel viewpoint on the significance of causality within these.

Misconceptions

Paper
Add Code

A Probabilistic Model to explain Self-Supervised Representation Learning

no code implementations • 2 Feb 2024 • Alice Bizeul, Bernhard Schölkopf, Carl Allen

Self-supervised learning (SSL) learns representations by leveraging an auxiliary unsupervised task, such as classifying semantically related samples, e. g. different data augmentations or modalities.

Representation Learning Self-Supervised Learning

Paper
Add Code

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

no code implementations • 31 Jan 2024 • Andreas Opedal, Alessandro Stolfo, Haruki Shirakami, Ying Jiao, Ryan Cotterell, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan

We find evidence that LLMs, with and without instruction-tuning, exhibit human-like biases in both the text-comprehension and the solution-planning steps of the solving process, but not during the final step which relies on the problem's arithmetic expressions (solution execution).

Reading Comprehension

Paper
Add Code

Identifying Policy Gradient Subspaces

no code implementations • 12 Jan 2024 • Jan Schneider, Pierre Schumacher, Simon Guist, Le Chen, Daniel Häufle, Bernhard Schölkopf, Dieter Büchler

Policy gradient methods hold great potential for solving complex continuous control tasks.

Continuous Control Policy Gradient Methods +1

Paper
Add Code

RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks

no code implementations • 11 Jan 2024 • Partha Ghosh, Soubhik Sanyal, Cordelia Schmid, Bernhard Schölkopf

To capture these dependencies, our approach incorporates a hybrid explicit-implicit tri-plane representation inspired by 3D-aware generative frameworks developed for three-dimensional object representation and employs a singular latent code to model an entire video sequence.

Generative Adversarial Network Optical Flow Estimation +1

Paper
Add Code

Independent Mechanism Analysis and the Manifold Hypothesis

no code implementations • 20 Dec 2023 • Shubhangi Ghosh, Luigi Gresele, Julius von Kügelgen, Michel Besserve, Bernhard Schölkopf

As typical in ICA, previous work focused on the case with an equal number of latent components and observed mixtures.

Representation Learning

Paper
Add Code

Inferring Atmospheric Properties of Exoplanets with Flow Matching and Neural Importance Sampling

no code implementations • 13 Dec 2023 • Timothy D. Gebhard, Jonas Wildberger, Maximilian Dax, Daniel Angerhausen, Sascha P. Quanz, Bernhard Schölkopf

Atmospheric retrievals (AR) characterize exoplanets by estimating atmospheric parameters from observed light spectra, typically by framing the task as a Bayesian inference problem.

Bayesian Inference

Paper
Add Code

CLadder: Assessing Causal Reasoning in Language Models

1 code implementation • NeurIPS 2023 • Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf

Much of the existing work in natural language processing (NLP) focuses on evaluating commonsense causal reasoning in LLMs, thus failing to assess whether a model can perform causal inference in accordance with a set of well-defined formal rules.

Causal Inference Commonsense Causal Reasoning +1

Paper
Code

Targeted Reduction of Causal Models

no code implementations • 30 Nov 2023 • Armin Kekić, Bernhard Schölkopf, Michel Besserve

Why does a phenomenon occur?

Paper
Add Code

GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs

no code implementations • 30 Nov 2023 • Gege Gao, Weiyang Liu, Anpei Chen, Andreas Geiger, Bernhard Schölkopf

As pretrained text-to-image diffusion models become increasingly powerful, recent efforts have been made to distill knowledge from these text-to-image pretrained models for optimizing a text-guided 3D model.

Paper
Add Code

Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

no code implementations • 15 Nov 2023 • Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data.

Data Augmentation Disentanglement

Paper
Add Code

Navigating the Ocean of Biases: Political Bias Attribution in Language Models via Causal Structures

1 code implementation • 15 Nov 2023 • David F. Jenny, Yann Billeter, Mrinmaya Sachan, Bernhard Schölkopf, Zhijing Jin

The rapid advancement of Large Language Models (LLMs) has sparked intense debate regarding their ability to perceive and interpret complex socio-political landscapes.

Decision Making

Paper
Code

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

1 code implementation • 10 Nov 2023 • Weiyang Liu, Zeju Qiu, Yao Feng, Yuliang Xiu, Yuxuan Xue, Longhui Yu, Haiwen Feng, Zhen Liu, Juyeon Heo, Songyou Peng, Yandong Wen, Michael J. Black, Adrian Weller, Bernhard Schölkopf

We apply this parameterization to OFT, creating a novel parameter-efficient finetuning method, called Orthogonal Butterfly (BOFT).

1,966

Paper
Code

CausalCite: A Causal Formulation of Paper Citations

1 code implementation • 5 Nov 2023 • Ishan Kumar, Zhijing Jin, Ehsan Mokhtarian, Siyuan Guo, Yuen Chen, Mrinmaya Sachan, Bernhard Schölkopf

Evaluating the significance of a paper is pivotal yet challenging for the scientific community.

Causal Inference counterfactual

Paper
Code

Causal Modeling with Stationary Diffusions

1 code implementation • 26 Oct 2023 • Lars Lorch, Andreas Krause, Bernhard Schölkopf

We develop a novel approach towards causal inference.

Causal Inference

Paper
Code

Ghost on the Shell: An Expressive Representation of General 3D Shapes

no code implementations • 23 Oct 2023 • Zhen Liu, Yao Feng, Yuliang Xiu, Weiyang Liu, Liam Paull, Michael J. Black, Bernhard Schölkopf

Recent work has focused on the former, and methods for reconstructing open surfaces do not support fast reconstruction with material and lighting or unconditional generative modelling.

Paper
Add Code

Pairwise Similarity Learning is SimPLE

2 code implementations • ICCV 2023 • Yandong Wen, Weiyang Liu, Yao Feng, Bhiksha Raj, Rita Singh, Adrian Weller, Michael J. Black, Bernhard Schölkopf

In this paper, we focus on a general yet important learning problem, pairwise similarity learning (PSL).

Face Recognition Image Retrieval +4

251

Paper
Code

Deep Backtracking Counterfactuals for Causally Compliant Explanations

no code implementations • 11 Oct 2023 • Klaus-Rudolf Kladny, Julius von Kügelgen, Bernhard Schölkopf, Michael Muehlebach

Counterfactuals answer questions of what would have been observed under altered circumstances and can therefore offer valuable insights.

counterfactual Philosophy

Paper
Add Code

Borges and AI

no code implementations • 27 Sep 2023 • Léon Bottou, Bernhard Schölkopf

Many believe that Large Language Models (LLMs) open the era of Artificial Intelligence (AI).

Language Modelling

Paper
Add Code

Investigating the Impact of Action Representations in Policy Gradient Algorithms

no code implementations • 13 Sep 2023 • Jan Schneider, Pierre Schumacher, Daniel Häufle, Bernhard Schölkopf, Dieter Büchler

Reinforcement learning~(RL) is a versatile framework for learning to solve complex real-world tasks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Parameterizing pressure-temperature profiles of exoplanet atmospheres with neural networks

1 code implementation • 6 Sep 2023 • Timothy D. Gebhard, Daniel Angerhausen, Björn S. Konrad, Eleonora Alei, Sascha P. Quanz, Bernhard Schölkopf

When training and evaluating our method on two publicly available datasets of self-consistent PT profiles, we find that our method achieves, on average, better fit quality than existing baseline methods, despite using fewer parameters.

Bayesian Inference

Paper
Code

SE(3) Equivariant Augmented Coupling Flows

1 code implementation • NeurIPS 2023 • Laurence I. Midgley, Vincent Stimper, Javier Antorán, Emile Mathieu, Bernhard Schölkopf, José Miguel Hernández-Lobato

Coupling normalizing flows allow for fast sampling and density evaluation, making them the tool of choice for probabilistic modeling of physical systems.

Paper
Code

Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World

no code implementations • 15 Aug 2023 • Nico Gürtler, Felix Widmaier, Cansu Sancaktar, Sebastian Blaes, Pavel Kolev, Stefan Bauer, Manuel Wüthrich, Markus Wulfmeier, Martin Riedmiller, Arthur Allshire, Qiang Wang, Robert McCarthy, Hangyeol Kim, Jongchan Baek, Wookyong Kwon, Shanliang Qian, Yasunori Toshimitsu, Mike Yan Michelis, Amirhossein Kazemipour, Arman Raayatsanati, Hehui Zheng, Barnabas Gavin Cangan, Bernhard Schölkopf, Georg Martius

For this reason, a large part of the reinforcement learning (RL) community uses simulators to develop and benchmark algorithms.

Offline RL reinforcement-learning +1

Paper
Add Code

Benchmarking Offline Reinforcement Learning on Real-Robot Hardware

2 code implementations • 28 Jul 2023 • Nico Gürtler, Sebastian Blaes, Pavel Kolev, Felix Widmaier, Manuel Wüthrich, Stefan Bauer, Bernhard Schölkopf, Georg Martius

To coordinate the efforts of the research community toward tackling this problem, we propose a benchmark including: i) a large collection of data for offline learning from a dexterous manipulation platform on two tasks, obtained with capable RL agents trained in simulation; ii) the option to execute learned policies on a real-world robotic system and a simulation for efficient debugging.

Benchmarking reinforcement-learning

Paper
Code

Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

no code implementations • 19 Jul 2023 • Cian Eastwood, Shashank Singh, Andrei Liviu Nicolicioiu, Marin Vlastelica, Julius von Kügelgen, Bernhard Schölkopf

To avoid failures on out-of-distribution data, recent works have sought to extract features that have an invariant or stable relationship with the label across domains, discarding "spurious" or unstable features whose relationship with the label changes across domains.

Paper
Add Code

The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks

1 code implementation • 14 Jun 2023 • Aaron Spieler, Nasim Rahaman, Georg Martius, Bernhard Schölkopf, Anna Levina

Biological cortical neurons are remarkably sophisticated computational devices, temporally integrating their vast synaptic input over an intricate dendritic tree, subject to complex, nonlinearly interacting internal biological processes.

Ranked #1 on Time Series on neuronIO

16k Classification +4

Paper
Code

Controlling Text-to-Image Diffusion by Orthogonal Finetuning

no code implementations • NeurIPS 2023 • Zeju Qiu, Weiyang Liu, Haiwen Feng, Yuxuan Xue, Yao Feng, Zhen Liu, Dan Zhang, Adrian Weller, Bernhard Schölkopf

To tackle this challenge, we introduce a principled finetuning method -- Orthogonal Finetuning (OFT), for adapting text-to-image diffusion models to downstream tasks.

Paper
Add Code

Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

1 code implementation • 9 Jun 2023 • Klaus-Rudolf Kladny, Julius von Kügelgen, Bernhard Schölkopf, Michael Muehlebach

We study causal effect estimation from a mixture of observational and interventional data in a confounded linear regression model with multivariate treatments.

Paper
Code

Can Large Language Models Infer Causation from Correlation?

1 code implementation • 9 Jun 2023 • Zhijing Jin, Jiarui Liu, Zhiheng Lyu, Spencer Poff, Mrinmaya Sachan, Rada Mihalcea, Mona Diab, Bernhard Schölkopf

In this work, we propose the first benchmark dataset to test the pure causal inference skills of large language models (LLMs).

Causal Inference

Paper
Code

Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels

1 code implementation • 6 Jun 2023 • Alexander Immer, Tycho F. A. van der Ouderaa, Mark van der Wilk, Gunnar Rätsch, Bernhard Schölkopf

Recent works show that Bayesian model selection with Laplace approximations can allow to optimize such hyperparameters just like standard neural network parameters using gradients and on the training data.

Hyperparameter Optimization Model Selection

Paper
Code

Learning Linear Causal Representations from Interventions under General Nonlinear Mixing

no code implementations • NeurIPS 2023 • Simon Buchholz, Goutham Rajendran, Elan Rosenfeld, Bryon Aragam, Bernhard Schölkopf, Pradeep Ravikumar

We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general.

counterfactual

Paper
Add Code

Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding

no code implementations • 1 Jun 2023 • Alizée Pace, Hugo Yèche, Bernhard Schölkopf, Gunnar Rätsch, Guy Tennenholtz

A prominent challenge of offline reinforcement learning (RL) is the issue of hidden confounding: unobserved variables may influence both the actions taken by the agent and the observed outcomes.

Management Offline RL +2

Paper
Add Code

Membership Inference Attacks against Language Models via Neighbourhood Comparison

1 code implementation • 29 May 2023 • Justus Mattern, FatemehSadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan, Taylor Berg-Kirkpatrick

To investigate whether this fragility provides a layer of safety, we propose and evaluate neighbourhood attacks, which compare model scores for a given sample to scores of synthetically generated neighbour texts and therefore eliminate the need for access to the training data distribution.

Paper
Code

Flow Matching for Scalable Simulation-Based Inference

1 code implementation • NeurIPS 2023 • Maximilian Dax, Jonas Wildberger, Simon Buchholz, Stephen R. Green, Jakob H. Macke, Bernhard Schölkopf

Neural posterior estimation methods based on discrete normalizing flows have become established tools for simulation-based inference (SBI), but scaling them to high-dimensional problems can be challenging.

Paper
Code

Causal Component Analysis

1 code implementation • NeurIPS 2023 • Liang Wendong, Armin Kekić, Julius von Kügelgen, Simon Buchholz, Michel Besserve, Luigi Gresele, Bernhard Schölkopf

As a corollary, this interventional perspective also leads to new identifiability results for nonlinear ICA -- a special case of CauCA with an empty graph -- requiring strictly fewer datasets than previous results.

Representation Learning

Paper
Code

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations

1 code implementation • 23 May 2023 • Yuxin Ren, Qipeng Guo, Zhijing Jin, Shauli Ravfogel, Mrinmaya Sachan, Bernhard Schölkopf, Ryan Cotterell

Transformer models bring propelling advances in various NLP tasks, thus inducing lots of interpretability research on the learned representations of the models.

Paper
Code

Provably Learning Object-Centric Representations

no code implementations • 23 May 2023 • Jack Brady, Roland S. Zimmermann, Yash Sharma, Bernhard Schölkopf, Julius von Kügelgen, Wieland Brendel

Under this generative process, we prove that the ground-truth object representations can be identified by an invertible and compositional inference model, even in the presence of dependencies between objects.

Object Representation Learning

Paper
Add Code

Estimation Beyond Data Reweighting: Kernel Method of Moments

1 code implementation • 18 May 2023 • Heiner Kremer, Yassine Nemmour, Bernhard Schölkopf, Jia-Jie Zhu

We provide a variant of our estimator for conditional moment restrictions and show that it is asymptotically first-order optimal for such problems.

Causal Inference

Paper
Code

The Hessian perspective into the Nature of Convolutional Neural Networks

no code implementations • 16 May 2023 • Sidak Pal Singh, Thomas Hofmann, Bernhard Schölkopf

While Convolutional Neural Networks (CNNs) have long been investigated and applied, as well as theorized, we aim to provide a slightly different perspective into their nature -- through the perspective of their Hessian maps.

Paper
Add Code

Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good

1 code implementation • 9 May 2023 • Fernando Gonzalez, Zhijing Jin, Bernhard Schölkopf, Tom Hope, Mrinmaya Sachan, Rada Mihalcea

Using state-of-the-art NLP models, we address each of these tasks and use them on the entire ACL Anthology, resulting in a visualization workspace that gives researchers a comprehensive overview of the field of NLP4SG.

Paper
Code

Leveraging sparse and shared feature activations for disentangled representation learning

no code implementations • NeurIPS 2023 • Marco Fumero, Florian Wenzel, Luca Zancato, Alessandro Achille, Emanuele Rodolà, Stefano Soatto, Bernhard Schölkopf, Francesco Locatello

Recovering the latent factors of variation of high dimensional data has so far focused on simple synthetic settings.

Representation Learning

Paper
Add Code

Out-of-Variable Generalization for Discriminative Models

no code implementations • 16 Apr 2023 • Siyuan Guo, Jonas Wildberger, Bernhard Schölkopf

The ability of an agent to do well in new environments is a critical aspect of intelligence.

Out-of-Distribution Generalization

Paper
Add Code

Dataflow graphs as complete causal graphs

1 code implementation • 16 Mar 2023 • Andrei Paleyes, Siyuan Guo, Bernhard Schölkopf, Neil D. Lawrence

Component-based development is one of the core principles behind modern software engineering practices.

Paper
Code

Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap

1 code implementation • 11 Mar 2023 • Weiyang Liu, Longhui Yu, Adrian Weller, Bernhard Schölkopf

We then use hyperspherical uniformity (which characterizes the degree of uniformity on the unit hypersphere) as a unified framework to quantify these two objectives.

Paper
Code

Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning

no code implementations • 3 Mar 2023 • Simon Guist, Jan Schneider, Alexander Dittrich, Vincent Berenz, Bernhard Schölkopf, Dieter Büchler

Reinforcement learning has shown great potential in solving complex tasks when large amounts of data can be generated with little effort.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Likelihood Annealing: Fast Calibrated Uncertainty for Regression

no code implementations • 21 Feb 2023 • Uddeshya Upadhyay, Jae Myung Kim, Cordelia Schmidt, Bernhard Schölkopf, Zeynep Akata

Recent advances in deep learning have shown that uncertainty estimation is becoming increasingly important in applications such as medical imaging, natural language processing, and autonomous systems.

Denoising Image Super-Resolution +2

Paper
Add Code

On the Interventional Kullback-Leibler Divergence

no code implementations • 10 Feb 2023 • Jonas Wildberger, Siyuan Guo, Arnab Bhattacharyya, Bernhard Schölkopf

Modern machine learning approaches excel in static settings where a large amount of i. i. d.

Paper
Add Code

Robustness Implies Fairness in Causal Algorithmic Recourse

2 code implementations • 7 Feb 2023 • Ahmad-Reza Ehyaei, Amir-Hossein Karimi, Bernhard Schölkopf, Setareh Maghsudi

Algorithmic recourse aims to disclose the inner workings of the black-box decision process in situations where decisions have significant consequences, by providing recommendations to empower beneficiaries to achieve a more favorable outcome.

Adversarial Robustness Fairness

Paper
Code

Towards fully covariant machine learning

no code implementations • 31 Jan 2023 • Soledad Villar, David W. Hogg, Weichi Yao, George A. Kevrekidis, Bernhard Schölkopf

We discuss links to causal modeling, and argue that the implementation of passive symmetries is particularly valuable when the goal of the learning problem is to generalize out of sample.

Paper
Add Code

Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion

1 code implementation • 27 Jan 2023 • Flavio Schneider, Ojasv Kamal, Zhijing Jin, Bernhard Schölkopf

Recent years have seen the rapid development of large generative models for text; however, much less research has explored the connection between text and another "language" of communication -- music.

Image Generation Music Generation +1

1,781

Paper
Code

normflows: A PyTorch Package for Normalizing Flows

1 code implementation • 26 Jan 2023 • Vincent Stimper, David Liu, Andrew Campbell, Vincent Berenz, Lukas Ryll, Bernhard Schölkopf, José Miguel Hernández-Lobato

It allows to build normalizing flow models from a suite of base distributions, flow layers, and neural networks.

Image Generation Variational Inference

607

Paper
Code

Multi-Armed Bandits and Quantum Channel Oracles

no code implementations • 20 Jan 2023 • Simon Buchholz, Jonas M. Kübler, Bernhard Schölkopf

Here we introduce further bandit models where we only have limited access to the randomness of the rewards, but we can still query the arms in superposition.

Multi-Armed Bandits reinforcement-learning +1

Paper
Add Code

Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning

1 code implementation • 12 Jan 2023 • Yuejiang Liu, Alexandre Alahi, Chris Russell, Max Horn, Dominik Zietlow, Bernhard Schölkopf, Francesco Locatello

Recent years have seen a surge of interest in learning high-level causal representations from low-level image pairs under interventions.

counterfactual Representation Learning

Paper
Code

Understanding Stereotypes in Language Models: Towards Robust Measurement and Zero-Shot Debiasing

no code implementations • 20 Dec 2022 • Justus Mattern, Zhijing Jin, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf

Generated texts from large pretrained language models have been shown to exhibit a variety of harmful, human-like biases about various demographics.

Benchmarking

Paper
Add Code

Evaluating vaccine allocation strategies using simulation-assisted causal modelling

1 code implementation • 14 Dec 2022 • Armin Kekić, Jonas Dehning, Luigi Gresele, Julius von Kügelgen, Viola Priesemann, Bernhard Schölkopf

Early on during a pandemic, vaccine availability is limited, requiring prioritisation of different population groups.

counterfactual

Paper
Code

On the Relationship Between Explanation and Prediction: A Causal View

no code implementations • 13 Dec 2022 • Amir-Hossein Karimi, Krikamol Muandet, Simon Kornblith, Bernhard Schölkopf, Been Kim

Our work borrows tools from causal inference to systematically assay this relationship.

Causal Inference

Paper
Add Code

Adapting to noise distribution shifts in flow-based gravitational-wave inference

no code implementations • 16 Nov 2022 • Jonas Wildberger, Maximilian Dax, Stephen R. Green, Jonathan Gair, Michael Pürrer, Jakob H. Macke, Alessandra Buonanno, Bernhard Schölkopf

Deep learning techniques for gravitational-wave parameter estimation have emerged as a fast alternative to standard samplers $\unicode{x2013}$ producing results of comparable accuracy.

Paper
Add Code

Federated Causal Discovery From Interventions

3 code implementations • 7 Nov 2022 • Amin Abyaneh, Nino Scherrer, Patrick Schwab, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou

We propose FedCDI, a federated framework for inferring causal structures from distributed data containing interventional samples.

Causal Discovery Federated Learning +1

Paper
Code

A General Purpose Neural Architecture for Geospatial Systems

no code implementations • 4 Nov 2022 • Nasim Rahaman, Martin Weiss, Frederik Träuble, Francesco Locatello, Alexandre Lacoste, Yoshua Bengio, Chris Pal, Li Erran Li, Bernhard Schölkopf

Geospatial Information Systems are used by researchers and Humanitarian Assistance and Disaster Response (HADR) practitioners to support a wide variety of important applications.

Disaster Response Earth Observation +2

Paper
Add Code

Iterative Teaching by Data Hallucination

1 code implementation • 31 Oct 2022 • Zeju Qiu, Weiyang Liu, Tim Z. Xiao, Zhen Liu, Umang Bhatt, Yucen Luo, Adrian Weller, Bernhard Schölkopf

We consider the problem of iterative machine teaching, where a teacher sequentially provides examples based on the status of a learner under a discrete input space (i. e., a pool of finite samples), which greatly limits the teacher's capability.

Hallucination

Paper
Code

Spectral Representation Learning for Conditional Moment Models

no code implementations • 29 Oct 2022 • Ziyu Wang, Yucen Luo, Yueru Li, Jun Zhu, Bernhard Schölkopf

For nonparametric conditional moment models, efficient estimation often relies on preimposed conditions on various measures of ill-posedness of the hypothesis space, which are hard to validate when flexible models are used.

Causal Inference Representation Learning

Paper
Add Code

A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models

1 code implementation • 21 Oct 2022 • Alessandro Stolfo, Zhijing Jin, Kumar Shridhar, Bernhard Schölkopf, Mrinmaya Sachan

By grounding the behavioral analysis in a causal graph describing an intuitive reasoning process, we study the behavior of language models in terms of robustness and sensitivity to direct interventions in the input space.

Math Mathematical Reasoning

Paper
Code

Neural Attentive Circuits

no code implementations • 14 Oct 2022 • Nasim Rahaman, Martin Weiss, Francesco Locatello, Chris Pal, Yoshua Bengio, Bernhard Schölkopf, Li Erran Li, Nicolas Ballas

Recent work has seen the development of general purpose neural architectures that can be trained to perform tasks across diverse data modalities.

Point Cloud Classification text-classification +1

Paper
Add Code

On the Identifiability and Estimation of Causal Location-Scale Noise Models

1 code implementation • 13 Oct 2022 • Alexander Immer, Christoph Schultheiss, Julia E. Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx

We study the class of location-scale or heteroscedastic noise models (LSNMs), in which the effect $Y$ can be written as a function of the cause $X$ and a noise source $N$ independent of $X$, which may be scaled by a positive function $g$ over the cause, i. e., $Y = f(X) + g(X)N$.

Causal Discovery Causal Inference

Paper
Code

Neural Importance Sampling for Rapid and Reliable Gravitational-Wave Inference

1 code implementation • 11 Oct 2022 • Maximilian Dax, Stephen R. Green, Jonathan Gair, Michael Pürrer, Jonas Wildberger, Jakob H. Macke, Alessandra Buonanno, Bernhard Schölkopf

This shows a median sample efficiency of $\approx 10\%$ (two orders-of-magnitude better than standard samplers) as well as a ten-fold reduction in the statistical uncertainty in the log evidence.

Paper
Code

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

1 code implementation • 4 Oct 2022 • Zhijing Jin, Sydney Levine, Fernando Gonzalez, Ojasv Kamal, Maarten Sap, Mrinmaya Sachan, Rada Mihalcea, Josh Tenenbaum, Bernhard Schölkopf

Using a state-of-the-art large language model (LLM) as a basis, we propose a novel moral chain of thought (MORALCOT) prompting strategy that combines the strengths of LLMs with theories of moral reasoning developed in cognitive science to predict human moral judgments.

Language Modelling Large Language Model +1

Paper
Code

DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability

1 code implementation • 1 Oct 2022 • Cian Eastwood, Andrei Liviu Nicolicioiu, Julius von Kügelgen, Armin Kekić, Frederik Träuble, Andrea Dittadi, Bernhard Schölkopf

In representation learning, a common approach is to seek representations which disentangle the underlying factors of variation.

Disentanglement Informativeness

Paper
Code

Bridging the Gap to Real-World Object-Centric Learning

3 code implementations • 29 Sep 2022 • Maximilian Seitzer, Max Horn, Andrii Zadaianchuk, Dominik Zietlow, Tianjun Xiao, Carl-Johann Simon-Gabriel, Tong He, Zheng Zhang, Bernhard Schölkopf, Thomas Brox, Francesco Locatello

Humans naturally decompose their environment into entities at the appropriate level of abstraction to act in the world.

Object

Paper
Code

Function Classes for Identifiable Nonlinear Independent Component Analysis

no code implementations • 12 Aug 2022 • Simon Buchholz, Michel Besserve, Bernhard Schölkopf

Several families of spurious solutions fitting perfectly the data, but that do not correspond to the ground truth factors can be constructed in generic settings.

Paper
Add Code

Flow Annealed Importance Sampling Bootstrap

3 code implementations • 3 Aug 2022 • Laurence Illing Midgley, Vincent Stimper, Gregor N. C. Simm, Bernhard Schölkopf, José Miguel Hernández-Lobato

Normalizing flows are tractable density models that can approximate complicated target distributions, e. g. Boltzmann distributions of physical systems.

607

Paper
Code

Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions

1 code implementation • 25 Jul 2022 • Hamza Keurti, Hsiao-Ru Pan, Michel Besserve, Benjamin F. Grewe, Bernhard Schölkopf

How can agents learn internal models that veridically represent interactions with the real world is a largely open question.

Open-Ended Question Answering Representation Learning +1

Paper
Code

Discrete Key-Value Bottleneck

1 code implementation • 22 Jul 2022 • Frederik Träuble, Anirudh Goyal, Nasim Rahaman, Michael Mozer, Kenji Kawaguchi, Yoshua Bengio, Bernhard Schölkopf

Deep neural networks perform well on classification tasks where data streams are i. i. d.

Class Incremental Learning Incremental Learning

Paper
Code

Probable Domain Generalization via Quantile Risk Minimization

2 code implementations • 20 Jul 2022 • Cian Eastwood, Alexander Robey, Shashank Singh, Julius von Kügelgen, Hamed Hassani, George J. Pappas, Bernhard Schölkopf

By minimizing the $\alpha$-quantile of predictor's risk distribution over domains, QRM seeks predictors that perform well with probability $\alpha$.

Domain Generalization

1,332

Paper
Code

Structural Causal 3D Reconstruction

no code implementations • 20 Jul 2022 • Weiyang Liu, Zhen Liu, Liam Paull, Adrian Weller, Bernhard Schölkopf

This paper considers the problem of unsupervised 3D object reconstruction from in-the-wild single-view images.

3D Object Reconstruction 3D Reconstruction +2

Paper
Add Code

Assaying Out-Of-Distribution Generalization in Transfer Learning

1 code implementation • 19 Jul 2022 • Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e. g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations.

Adversarial Robustness Out-of-Distribution Generalization +1

Paper
Code

Probing the Robustness of Independent Mechanism Analysis for Representation Learning

no code implementations • 13 Jul 2022 • Joanna Sliwa, Shubhangi Ghosh, Vincent Stimper, Luigi Gresele, Bernhard Schölkopf

One aim of representation learning is to recover the original latent code that generated the data, a task which requires additional information or inductive biases.

Representation Learning

Paper
Add Code

Functional Generalized Empirical Likelihood Estimation for Conditional Moment Restrictions

1 code implementation • 11 Jul 2022 • Heiner Kremer, Jia-Jie Zhu, Krikamol Muandet, Bernhard Schölkopf

Important problems in causal inference, economics, and, more generally, robust machine learning can be expressed as conditional moment restrictions, but estimation becomes challenging as it requires solving a continuum of unconditional moment restrictions.

BIG-bench Machine Learning Causal Inference

Paper
Code

Variational Causal Dynamics: Discovering Modular World Models from Interventions

no code implementations • 22 Jun 2022 • Anson Lei, Bernhard Schölkopf, Ingmar Posner

In doing so, VCD significantly extends the capabilities of the current state-of-the-art in latent world models while also comparing favourably in terms of prediction accuracy.

Causal Discovery Variational Inference

Paper
Add Code

AutoML Two-Sample Test

3 code implementations • 17 Jun 2022 • Jonas M. Kübler, Vincent Stimper, Simon Buchholz, Krikamol Muandet, Bernhard Schölkopf

Two-sample tests are important in statistics and machine learning, both as tools for scientific discovery as well as to detect distribution shifts.

AutoML Two-sample testing +1

Paper
Code

Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

no code implementations • 7 Jun 2022 • Aniket Das, Bernhard Schölkopf, Michael Muehlebach

We obtain tight convergence rates for RR and SO and demonstrate that these strategies lead to faster convergence than uniform sampling.

Paper
Add Code

Embrace the Gap: VAEs Perform Independent Mechanism Analysis

1 code implementation • 6 Jun 2022 • Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von Kügelgen, Dominik Zietlow, Bernhard Schölkopf, Georg Martius, Wieland Brendel, Michel Besserve

Leveraging self-consistency, we show that the ELBO converges to a regularized log-likelihood.

Inductive Bias Representation Learning +1

Paper
Code

Causal Discovery in Heterogeneous Environments Under the Sparse Mechanism Shift Hypothesis

1 code implementation • 4 Jun 2022 • Ronan Perry, Julius von Kügelgen, Bernhard Schölkopf

data.

Causal Discovery

Paper
Code

BaCaDI: Bayesian Causal Discovery with Unknown Interventions

1 code implementation • 3 Jun 2022 • Alexander Hägele, Jonas Rothfuss, Lars Lorch, Vignesh Ram Somnath, Bernhard Schölkopf, Andreas Krause

Inferring causal structures from experimentation is a central task in many domains.

Causal Discovery Variational Inference

Paper
Code

Amortized Inference for Causal Structure Learning

1 code implementation • 25 May 2022 • Lars Lorch, Scott Sussex, Jonas Rothfuss, Andreas Krause, Bernhard Schölkopf

Rather than searching over structures, we train a variational inference model to directly predict the causal structure from observational or interventional data.

Causal Discovery Inductive Bias +1

Paper
Code

Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance

1 code implementation • NAACL 2022 • Jingwei Ni, Zhijing Jin, Markus Freitag, Mrinmaya Sachan, Bernhard Schölkopf

We show that these two factors have a large causal effect on the MT performance, in addition to the test-model direction mismatch highlighted by existing work on the impact of translationese.

Machine Translation Translation

Paper
Code

Half-sibling regression meets exoplanet imaging: PSF modeling and subtraction using a flexible, domain knowledge-driven, causal framework

1 code implementation • 7 Apr 2022 • Timothy D. Gebhard, Markus J. Bonse, Sascha P. Quanz, Bernhard Schölkopf

Our HSR-based method provides an alternative, flexible and promising approach to the challenge of modeling and subtracting the stellar PSF and systematic noise in exoplanet imaging data.

Denoising Pupil Tracking +1

Paper
Code

From Statistical to Causal Learning

no code implementations • 1 Apr 2022 • Bernhard Schölkopf, Julius von Kügelgen

We describe basic ideas underlying research to build and understand artificially intelligent systems: from symbolic approaches via statistical learning to interventional models relying on concepts of causality.

BIG-bench Machine Learning

Paper
Add Code

Phenomenology of Double Descent in Finite-Width Neural Networks

no code implementations • ICLR 2022 • Sidak Pal Singh, Aurelien Lucchi, Thomas Hofmann, Bernhard Schölkopf

`Double descent' delineates the generalization behaviour of models depending on the regime they belong to: under- or over-parameterized.

Paper
Add Code

Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers

no code implementations • CVPR 2022 • Dominik Zietlow, Michael Lohaus, Guha Balakrishnan, Matthäus Kleindessner, Francesco Locatello, Bernhard Schölkopf, Chris Russell

Algorithmic fairness is frequently motivated in terms of a trade-off in which overall performance is decreased so as to improve performance on disadvantaged groups where the algorithm would otherwise be less accurate.

Fairness

Paper
Add Code

Score matching enables causal discovery of nonlinear additive noise models

no code implementations • 8 Mar 2022 • Paul Rolland, Volkan Cevher, Matthäus Kleindessner, Chris Russel, Bernhard Schölkopf, Dominik Janzing, Francesco Locatello

This paper demonstrates how to recover causal graphs from the score of the data distribution in non-linear additive (Gaussian) noise models.

Causal Discovery

Paper
Add Code

Interventions, Where and How? Experimental Design for Causal Models at Scale

1 code implementation • 3 Mar 2022 • Panagiotis Tigas, Yashas Annadani, Andrew Jesson, Bernhard Schölkopf, Yarin Gal, Stefan Bauer

Existing methods in experimental design for causal discovery from limited data either rely on linear assumptions for the SCM or select only the intervention target.

Causal Discovery Experimental Design

Paper
Code

Logical Fallacy Detection

2 code implementations • 28 Feb 2022 • Zhijing Jin, Abhinav Lalwani, Tejas Vaidhya, Xiaoyu Shen, Yiwen Ding, Zhiheng Lyu, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf

In this paper, we propose the task of logical fallacy detection, and provide a new dataset (Logic) of logical fallacies generally found in text, together with an additional challenge set for detecting logical fallacies in climate change claims (LogicClimate).

Language Modelling Logical Fallacies +2

Paper
Code

On Pitfalls of Identifiability in Unsupervised Learning. A Note on: "Desiderata for Representation Learning: A Causal Perspective"

no code implementations • 14 Feb 2022 • Shubhangi Ghosh, Luigi Gresele, Julius von Kügelgen, Michel Besserve, Bernhard Schölkopf

Model identifiability is a desirable property in the context of unsupervised representation learning.

Representation Learning

Paper
Add Code

Causal Inference Through the Structural Causal Marginal Problem

1 code implementation • 2 Feb 2022 • Luigi Gresele, Julius von Kügelgen, Jonas M. Kübler, Elke Kirschbaum, Bernhard Schölkopf, Dominik Janzing

We introduce an approach to counterfactual inference based on merging information from multiple datasets.

counterfactual Counterfactual Inference

Paper
Code

Compositional Multi-Object Reinforcement Learning with Linear Relation Networks

no code implementations • 31 Jan 2022 • Davide Mambelli, Frederik Träuble, Stefan Bauer, Bernhard Schölkopf, Francesco Locatello

Although reinforcement learning has seen remarkable progress over the last years, solving robust dexterous object-manipulation tasks in multi-object settings remains a challenge.

Object reinforcement-learning +2

Paper
Add Code

Physical Derivatives: Computing policy gradients by physical forward-propagation

no code implementations • 15 Jan 2022 • Arash Mehrjou, Ashkan Soleymani, Stefan Bauer, Bernhard Schölkopf

Model-free and model-based reinforcement learning are two ends of a spectrum.

Model-based Reinforcement Learning

Paper
Add Code

On the Adversarial Robustness of Causal Algorithmic Recourse

1 code implementation • 21 Dec 2021 • Ricardo Dominguez-Olmedo, Amir-Hossein Karimi, Bernhard Schölkopf

Algorithmic recourse seeks to provide actionable recommendations for individuals to overcome unfavorable classification outcomes from automated decision-making systems.

Adversarial Robustness Decision Making

Paper
Code

Learning soft interventions in complex equilibrium systems

1 code implementation • 10 Dec 2021 • Michel Besserve, Bernhard Schölkopf

Complex systems often contain feedback loops that can be described as cyclic causal models.

Paper
Code

Towards Principled Disentanglement for Domain Generalization

1 code implementation • CVPR 2022 • HANLIN ZHANG, Yi-Fan Zhang, Weiyang Liu, Adrian Weller, Bernhard Schölkopf, Eric P. Xing

To tackle this challenge, we first formalize the OOD generalization problem as constrained optimization, called Disentanglement-constrained Domain Generalization (DDG).

Disentanglement Domain Generalization

Paper
Code

Group equivariant neural posterior estimation

1 code implementation • ICLR 2022 • Maximilian Dax, Stephen R. Green, Jonathan Gair, Michael Deistler, Bernhard Schölkopf, Jakob H. Macke

We here describe an alternative method to incorporate equivariances under joint transformations of parameters and data.

Paper
Code

Cause-effect inference through spectral independence in linear dynamical systems: theoretical foundations

no code implementations • 29 Oct 2021 • Michel Besserve, Naji Shajarisales, Dominik Janzing, Bernhard Schölkopf

A new perspective has been provided based on the principle of Independence of Causal Mechanisms (ICM), leading to the Spectral Independence Criterion (SIC), postulating that the power spectral density (PSD) of the cause time series is uncorrelated with the squared modulus of the frequency response of the filter generating the effect.

Causal Discovery Causal Inference +2

Paper
Add Code

Resampling Base Distributions of Normalizing Flows

1 code implementation • 29 Oct 2021 • Vincent Stimper, Bernhard Schölkopf, José Miguel Hernández-Lobato

Normalizing flows are a popular class of models for approximating probability distributions.

Ranked #47 on Image Generation on CIFAR-10 (bits/dimension metric)

Density Estimation Image Generation

Paper
Code

GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL

no code implementations • 29 Oct 2021 • Sumedh A Sontakke, Stephen Iota, Zizhao Hu, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf

Extending the successes in supervised learning methods to the reinforcement learning (RL) setting, however, is difficult due to the data generating process - RL agents actively query their environment for data, and the data are a function of the policy followed by the agent.

Out of Distribution (OOD) Detection Reinforcement Learning (RL)

Paper
Add Code

Iterative Teaching by Label Synthesis

no code implementations • NeurIPS 2021 • Weiyang Liu, Zhen Liu, Hanchen Wang, Liam Paull, Bernhard Schölkopf, Adrian Weller

In this paper, we consider the problem of iterative machine teaching, where a teacher provides examples sequentially based on the current iterative learner.

Paper
Add Code

Distributional Robustness Regularized Scenario Optimization with Application to Model Predictive Control

no code implementations • 26 Oct 2021 • Yassine Nemmour, Bernhard Schölkopf, Jia-Jie Zhu

We provide a functional view of distributional robustness motivated by robust statistics and functional analysis.

Model Predictive Control

Paper
Add Code

Unsupervised Object Learning via Common Fate

1 code implementation • 13 Oct 2021 • Matthias Tangemann, Steffen Schneider, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Thomas Brox, Matthias Kümmerer, Matthias Bethge, Bernhard Schölkopf

Learning generative object models from unlabelled videos is a long standing problem and required for causal scene modeling.

Motion Segmentation Object

Paper
Code

Dynamic Inference with Neural Interpreters

no code implementations • NeurIPS 2021 • Nasim Rahaman, Muhammad Waleed Gondal, Shruti Joshi, Peter Gehler, Yoshua Bengio, Francesco Locatello, Bernhard Schölkopf

Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution.

Image Classification Systematic Generalization

Paper
Add Code

Action-Sufficient State Representation Learning for Control with Structural Constraints

no code implementations • 12 Oct 2021 • Biwei Huang, Chaochao Lu, Liu Leqi, José Miguel Hernández-Lobato, Clark Glymour, Bernhard Schölkopf, Kun Zhang

Perceived signals in real-world scenarios are usually high-dimensional and noisy, and finding and using their representation that contains essential and sufficient information required by downstream decision-making tasks will help improve computational efficiency and generalization ability in the tasks.

Computational Efficiency Decision Making +1

Paper
Add Code

You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction

no code implementations • ICLR 2022 • Osama Makansi, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Dominik Janzing, Thomas Brox, Bernhard Schölkopf

Applying this procedure to state-of-the-art trajectory prediction methods on standard benchmark datasets shows that they are, in fact, unable to reason about interactions.

Attribute Trajectory Prediction

Paper
Add Code

Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP

1 code implementation • EMNLP 2021 • Zhijing Jin, Julius von Kügelgen, Jingwei Ni, Tejas Vaidhya, Ayush Kaushal, Mrinmaya Sachan, Bernhard Schölkopf

The principle of independent causal mechanisms (ICM) states that generative processes of real world data consist of independent modules which do not influence or inform each other.

Causal Inference Domain Adaptation

Paper
Code

Boxhead: A Dataset for Learning Hierarchical Representations

no code implementations • NeurIPS Workshop SVRHM 2021 • Yukun Chen, Andrea Dittadi, Frederik Träuble, Stefan Bauer, Bernhard Schölkopf

Disentanglement is hypothesized to be beneficial towards a number of downstream tasks.

Disentanglement

Paper
Add Code

Spatial Context Awareness for Unsupervised Change Detection in Optical Satellite Images

no code implementations • 5 Oct 2021 • Lukas Kondmann, Aysim Toker, Sudipan Saha, Bernhard Schölkopf, Laura Leal-Taixé, Xiao Xiang Zhu

It uses this model to analyze differences in the pixel and its spatial context-based predictions in subsequent time periods for change detection.

Change Detection Earth Observation

Paper
Add Code

On the interventional consistency of autoencoders

no code implementations • 29 Sep 2021 • Giulia Lanzillotta, Felix Leeb, Stefan Bauer, Bernhard Schölkopf

Autoencoders have played a crucial role in the field of representation learning since its inception, proving to be a flexible learning scheme able to accommodate various notions of optimality of the representation.

Disentanglement

Paper
Add Code

Invariant Causal Representation Learning for Out-of-Distribution Generalization

no code implementations • ICLR 2022 • Chaochao Lu, Yuhuai Wu, José Miguel Hernández-Lobato, Bernhard Schölkopf

Extensive experiments on both synthetic and real-world datasets show that our approach outperforms a variety of baseline methods.

Out-of-Distribution Generalization Representation Learning

Paper
Add Code

Direct Advantage Estimation

1 code implementation • 13 Sep 2021 • Hsiao-Ru Pan, Nico Gürtler, Alexander Neitz, Bernhard Schölkopf

The predominant approach in reinforcement learning is to assign credit to actions based on the expected return.

Paper
Code

Learning Neural Causal Models with Active Interventions

1 code implementation • 6 Sep 2021 • Nino Scherrer, Olexa Bilaniuk, Yashas Annadani, Anirudh Goyal, Patrick Schwab, Bernhard Schölkopf, Michael C. Mozer, Yoshua Bengio, Stefan Bauer, Nan Rosemary Ke

Discovering causal structures from data is a challenging inference problem of fundamental importance in all areas of science.

Causal Discovery

Paper
Code

Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

1 code implementation • ICLR 2022 • Lukas Schott, Julius von Kügelgen, Frederik Träuble, Peter Gehler, Chris Russell, Matthias Bethge, Bernhard Schölkopf, Francesco Locatello, Wieland Brendel

An important component for generalization in machine learning is to uncover underlying latent factors of variation as well as the mechanism through which each factor acts in the world.

Representation Learning

Paper
Code

The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents

no code implementations • ICLR 2022 • Andrea Dittadi, Frederik Träuble, Manuel Wüthrich, Felix Widmaier, Peter Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer

By training 240 representations and over 10, 000 reinforcement learning (RL) policies on a simulated robotic setup, we evaluate to what extent different properties of pretrained VAE-based representations affect the OOD generalization of downstream agents.

Reinforcement Learning (RL) Representation Learning

Paper
Add Code

Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration

1 code implementation • ICLR 2022 • Cian Eastwood, Ian Mason, Christopher K. I. Williams, Bernhard Schölkopf

Existing methods for SFDA leverage entropy-minimization techniques which: (i) apply only to classification; (ii) destroy model calibration; and (iii) rely on the source model achieving a good level of feature-space class-separation in the target domain.

Source-Free Domain Adaptation

Paper
Code

Backward-Compatible Prediction Updates: A Probabilistic Approach

no code implementations • NeurIPS 2021 • Frederik Träuble, Julius von Kügelgen, Matthäus Kleindessner, Francesco Locatello, Bernhard Schölkopf, Peter Gehler

; and (ii) if the new predictions differ from the current ones, should we update?

Paper
Add Code

Generalization and Robustness Implications in Object-Centric Learning

1 code implementation • 1 Jul 2021 • Andrea Dittadi, Samuele Papa, Michele De Vita, Bernhard Schölkopf, Ole Winther, Francesco Locatello

The idea behind object-centric representation learning is that natural scenes can better be modeled as compositions of objects and their relations as opposed to distributed representations.

Inductive Bias Object +3

Paper
Code

Exploring the Latent Space of Autoencoders with Interventional Assays

1 code implementation • 30 Jun 2021 • Felix Leeb, Stefan Bauer, Michel Besserve, Bernhard Schölkopf

Autoencoders exhibit impressive abilities to embed the data manifold into a low-dimensional latent space, making them a staple of representation learning methods.

Disentanglement

Paper
Code

Shallow Representation is Deep: Learning Uncertainty-aware and Worst-case Random Feature Dynamics

no code implementations • 24 Jun 2021 • Diego Agudelo-España, Yassine Nemmour, Bernhard Schölkopf, Jia-Jie Zhu

Random features is a powerful universal function approximator that inherits the theoretical rigor of kernel methods and can scale up to modern learning tasks.

Paper
Add Code

Real-time gravitational-wave science with neural posterior estimation

1 code implementation • 23 Jun 2021 • Maximilian Dax, Stephen R. Green, Jonathan Gair, Jakob H. Macke, Alessandra Buonanno, Bernhard Schölkopf

We demonstrate unprecedented accuracy for rapid gravitational-wave parameter estimation with deep learning.

Paper
Code

Algorithmic Recourse in Partially and Fully Confounded Settings Through Bounding Counterfactual Effects

no code implementations • 22 Jun 2021 • Julius von Kügelgen, Nikita Agarwal, Jakob Zeitler, Afsaneh Mastouri, Bernhard Schölkopf

Algorithmic recourse aims to provide actionable recommendations to individuals to obtain a more favourable outcome from an automated decision-making system.

counterfactual Decision Making

Paper
Add Code

Towards Total Recall in Industrial Anomaly Detection

18 code implementations • CVPR 2022 • Karsten Roth, Latha Pemula, Joaquin Zepeda, Bernhard Schölkopf, Thomas Brox, Peter Gehler

Being able to spot defective parts is a critical component in large-scale industrial manufacturing.

Ranked #3 on Anomaly Detection on AeBAD-V

Outlier Detection Unsupervised Anomaly Detection

3,142

Paper
Code

Representation Learning for Out-of-distribution Generalization in Reinforcement Learning

no code implementations • ICML Workshop URL 2021 • Frederik Träuble, Andrea Dittadi, Manuel Wuthrich, Felix Widmaier, Peter Vincent Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer

Learning data representations that are useful for various downstream tasks is a cornerstone of artificial intelligence.

Out-of-Distribution Generalization reinforcement-learning +2

Paper
Add Code

CausalAdv: Adversarial Robustness through the Lens of Causality

1 code implementation • ICLR 2022 • Yonggang Zhang, Mingming Gong, Tongliang Liu, Gang Niu, Xinmei Tian, Bo Han, Bernhard Schölkopf, Kun Zhang

The adversarial vulnerability of deep neural networks has attracted significant attention in machine learning.

Adversarial Attack Adversarial Robustness

Paper
Code

Independent mechanism analysis, a new concept?

1 code implementation • NeurIPS 2021 • Luigi Gresele, Julius von Kügelgen, Vincent Stimper, Bernhard Schölkopf, Michel Besserve

Specifically, our approach is motivated by thinking of each source as independently influencing the mixing process.

blind source separation Representation Learning

Paper
Code

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

1 code implementation • NeurIPS 2021 • Julius von Kügelgen, Yash Sharma, Luigi Gresele, Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

A common practice is to perform data augmentation via hand-crafted transformations intended to leave the semantics of the data invariant.

Ranked #1 on Image Classification on Causal3DIdent

Data Augmentation Disentanglement +2

Paper
Code

The Inductive Bias of Quantum Kernels

1 code implementation • NeurIPS 2021 • Jonas M. Kübler, Simon Buchholz, Bernhard Schölkopf

Quantum computers offer the possibility to efficiently compute inner products of exponentially large density operators that are classically hard to compute.

Inductive Bias Quantum Machine Learning

Paper
Code

Causal Influence Detection for Improving Efficiency in Reinforcement Learning

2 code implementations • NeurIPS 2021 • Maximilian Seitzer, Bernhard Schölkopf, Georg Martius

Many reinforcement learning (RL) environments consist of independent entities that interact sparsely.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Instrument Space Selection for Kernel Maximum Moment Restriction

1 code implementation • 7 Jun 2021 • Rui Zhang, Krikamol Muandet, Bernhard Schölkopf, Masaaki Imaizumi

Kernel maximum moment restriction (KMMR) recently emerges as a popular framework for instrumental variable (IV) based conditional moment restriction (CMR) models with important applications in conditional moment (CM) testing and parameter estimation for IV regression and proximal causal learning.

Paper
Code

Diffusion-Based Representation Learning

no code implementations • 29 May 2021 • Korbinian Abstreiter, Sarthak Mittal, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou

In contrast, the introduced diffusion-based representation learning relies on a new formulation of the denoising score matching objective and thus encodes the information needed for denoising.

Denoising Representation Learning +1

Paper
Add Code

DiBS: Differentiable Bayesian Structure Learning

2 code implementations • NeurIPS 2021 • Lars Lorch, Jonas Rothfuss, Bernhard Schölkopf, Andreas Krause

In this work, we propose a general, fully differentiable framework for Bayesian structure learning (DiBS) that operates in the continuous space of a latent probabilistic graph representation.

Causal Discovery Variational Inference

Paper
Code

Fast and Slow Learning of Recurrent Independent Mechanisms

no code implementations • 18 May 2021 • Kanika Madan, Nan Rosemary Ke, Anirudh Goyal, Bernhard Schölkopf, Yoshua Bengio

To study these ideas, we propose a particular training framework in which we assume that the pieces of knowledge an agent needs and its reward function are stationary and can be re-used across tasks.

Meta-Learning

Paper
Add Code

Regret Bounds for Gaussian-Process Optimization in Large Domains

1 code implementation • NeurIPS 2021 • Manuel Wüthrich, Bernhard Schölkopf, Andreas Krause

These regret bounds illuminate the relationship between the number of evaluations, the domain size (i. e. cardinality of finite domains / Lipschitz constant of the covariance function in continuous domains), and the optimality of the retrieved function value.

Paper
Code

Pyfectious: An individual-level simulator to discover optimal containment polices for epidemic diseases

1 code implementation • 24 Mar 2021 • Arash Mehrjou, Ashkan Soleymani, Amin Abyaneh, Samir Bhatt, Bernhard Schölkopf, Stefan Bauer

Simulating the spread of infectious diseases in human communities is critical for predicting the trajectory of an epidemic and verifying various policies to control the devastating impacts of the outbreak.

Paper
Code

A prior-based approximate latent Riemannian metric

no code implementations • 9 Mar 2021 • Georgios Arvanitidis, Bogdan Georgiev, Bernhard Schölkopf

In this work we propose a surrogate conformal Riemannian metric in the latent space of a generative model that is simple, efficient and robust.

Paper
Add Code

Learning with Hyperspherical Uniformity

1 code implementation • 2 Mar 2021 • Weiyang Liu, Rongmei Lin, Zhen Liu, Li Xiong, Bernhard Schölkopf, Adrian Weller

Due to the over-parameterization nature, neural networks are a powerful tool for nonlinear function approximation.

Inductive Bias L2 Regularization

Paper
Code

Nonlinear Invariant Risk Minimization: A Causal Approach

no code implementations • 24 Feb 2021 • Chaochao Lu, Yuhuai Wu, Jośe Miguel Hernández-Lobato, Bernhard Schölkopf

Finally, in the discussion, we further explore the aforementioned assumption and propose a more general hypothesis, called the Agnostic Hypothesis: there exist a set of hidden causal factors affecting both inputs and outcomes.

BIG-bench Machine Learning Representation Learning

Paper
Add Code

Finding Stable Matchings in PhD Markets with Consistent Preferences and Cooperative Partners

no code implementations • 23 Feb 2021 • Maximilian Mordig, Riccardo Della Vecchia, Nicolò Cesa-Bianchi, Bernhard Schölkopf

Our setting is motivated by a PhD market of students, advisors, and co-advisors, and can be generalized to supply chain networks viewed as $n$-sided markets.

Computer Science and Game Theory Theoretical Economics Combinatorics

Paper
Add Code

Towards Causal Representation Learning

no code implementations • 22 Feb 2021 • Bernhard Schölkopf, Francesco Locatello, Stefan Bauer, Nan Rosemary Ke, Nal Kalchbrenner, Anirudh Goyal, Yoshua Bengio

The two fields of machine learning and graphical causality arose and developed separately.

BIG-bench Machine Learning Causal Inference +1

Paper
Add Code

Adversarially Robust Kernel Smoothing

1 code implementation • 16 Feb 2021 • Jia-Jie Zhu, Christina Kouridi, Yassine Nemmour, Bernhard Schölkopf

We propose a scalable robust learning algorithm combining kernel smoothing and robust optimization.

BIG-bench Machine Learning

Paper
Code

Conditional Distributional Treatment Effect with Kernel Conditional Mean Embeddings and U-Statistic Regression

no code implementations • 16 Feb 2021 • Junhyung Park, Uri Shalit, Bernhard Schölkopf, Krikamol Muandet

We propose to analyse the conditional distributional treatment effect (CoDiTE), which, in contrast to the more common conditional average treatment effect (CATE), is designed to encode a treatment's distributional aspects beyond the mean.

regression

Paper
Add Code

Bayesian Quadrature on Riemannian Data Manifolds

1 code implementation • 12 Feb 2021 • Christian Fröhlich, Alexandra Gessner, Philipp Hennig, Bernhard Schölkopf, Georgios Arvanitidis

Riemannian manifolds provide a principled way to model nonlinear geometric structure inherent in data.

Paper
Code

A Witness Two-Sample Test

1 code implementation • 10 Feb 2021 • Jonas M. Kübler, Wittawat Jitkrittum, Bernhard Schölkopf, Krikamol Muandet

That is, the test set is used to simultaneously estimate the expectations and define the basis points, while the training set only serves to select the kernel and is discarded.

Two-sample testing Vocal Bursts Valence Prediction

Paper
Code

Learning to interpret trajectories

no code implementations • ICLR 2021 • Alexander Neitz, Giambattista Parascandolo, Bernhard Schölkopf

By learning to predict trajectories of dynamical systems, model-based methods can make extensive use of all observations from past experience.

Paper
Add Code

Learned residual Gerchberg-Saxton network for computer generated holography

no code implementations • 1 Jan 2021 • Lennart Schlieder, Heiner Kremer, Valentin Volchkov, Kai Melde, Peer Fischer, Bernhard Schölkopf

Instead of an iterative optimization algorithm that converges to a (sub-)optimal solution, the inverse problem can be solved by training a neural network to directly estimate the inverse operator.

Paper
Add Code

Invariant Causal Representation Learning

no code implementations • 1 Jan 2021 • Chaochao Lu, Yuhuai Wu, José Miguel Hernández-Lobato, Bernhard Schölkopf

As an alternative, we propose Invariant Causal Representation Learning (ICRL), a learning paradigm that enables out-of-distribution generalization in the nonlinear setting (i. e., nonlinear representations and nonlinear classifiers).

Out-of-Distribution Generalization Representation Learning

Paper
Add Code

Spatially Structured Recurrent Modules

no code implementations • ICLR 2021 • Nasim Rahaman, Anirudh Goyal, Muhammad Waleed Gondal, Manuel Wuthrich, Stefan Bauer, Yash Sharma, Yoshua Bengio, Bernhard Schölkopf

Capturing the structure of a data-generating process by means of appropriate inductive biases can help in learning models that generalise well and are robust to changes in the input distribution.

Starcraft II Video Prediction

Paper
Add Code

Meta Attention Networks: Meta-Learning Attention to Modulate Information Between Recurrent Independent Mechanisms

no code implementations • ICLR 2021 • Kanika Madan, Nan Rosemary Ke, Anirudh Goyal, Bernhard Schölkopf, Yoshua Bengio

Decomposing knowledge into interchangeable pieces promises a generalization advantage when there are changes in distribution.

Meta-Learning

Paper
Add Code

Dependency Structure Discovery from Interventions

no code implementations • 1 Jan 2021 • Nan Rosemary Ke, Olexa Bilaniuk, Anirudh Goyal, Stefan Bauer, Bernhard Schölkopf, Michael Curtis Mozer, Hugo Larochelle, Christopher Pal, Yoshua Bengio

Promising results have driven a recent surge of interest in continuous optimization methods for Bayesian network structure learning from observational data.

Paper
Add Code

Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation

no code implementations • 16 Dec 2020 • Chaochao Lu, Biwei Huang, Ke Wang, José Miguel Hernández-Lobato, Kun Zhang, Bernhard Schölkopf

We propose counterfactual RL algorithms to learn both population-level and individual-level policies.

counterfactual Data Augmentation +3

Paper
Add Code

Assaying Large-scale Testing Models to Interpret COVID-19 Case Numbers

no code implementations • 3 Dec 2020 • Michel Besserve, Simon Buchholz, Bernhard Schölkopf

Large-scale testing is considered key to assess the state of the current COVID-19 pandemic.

Applications Populations and Evolution

Paper
Add Code

Causal analysis of Covid-19 Spread in Germany

no code implementations • NeurIPS 2020 • Atalanti Mastakouri, Bernhard Schölkopf

In this work, we study the causal relations among German regions in terms of the spread of Covid-19 since the beginning of the pandemic, taking into account the restriction policies that were applied by the different federal states.

feature selection Time Series +1

Paper
Add Code

COVI-AgentSim: an Agent-based Model for Evaluating Methods of Digital Contact Tracing

no code implementations • 30 Oct 2020 • Prateek Gupta, Tegan Maharaj, Martin Weiss, Nasim Rahaman, Hannah Alsdurf, Abhinav Sharma, Nanor Minoyan, Soren Harnois-Leblanc, Victor Schmidt, Pierre-Luc St. Charles, Tristan Deleu, Andrew Williams, Akshay Patel, Meng Qu, Olexa Bilaniuk, Gaétan Marceau Caron, Pierre Luc Carrier, Satya Ortiz-Gagné, Marc-Andre Rousseau, David Buckeridge, Joumana Ghosn, Yang Zhang, Bernhard Schölkopf, Jian Tang, Irina Rish, Christopher Pal, Joanna Merckx, Eilif B. Muller, Yoshua Bengio

The rapid global spread of COVID-19 has led to an unprecedented demand for effective methods to mitigate the spread of the disease, and various digital contact tracing (DCT) methods have emerged as a component of the solution.

Virology

Paper
Add Code

On the Transfer of Disentangled Representations in Realistic Settings

no code implementations • ICLR 2021 • Andrea Dittadi, Frederik Träuble, Francesco Locatello, Manuel Wüthrich, Vaibhav Agrawal, Ole Winther, Stefan Bauer, Bernhard Schölkopf

Learning meaningful representations that disentangle the underlying structure of the data generating process is considered to be of key importance in machine learning.

Disentanglement

Paper
Add Code

A Sober Look at the Unsupervised Learning of Disentangled Representations and their Evaluation

no code implementations • 27 Oct 2020 • Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

The idea behind the \emph{unsupervised} learning of \emph{disentangled} representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms.

Disentanglement

Paper
Add Code

Predicting Infectiousness for Proactive Contact Tracing

1 code implementation • ICLR 2021 • Yoshua Bengio, Prateek Gupta, Tegan Maharaj, Nasim Rahaman, Martin Weiss, Tristan Deleu, Eilif Muller, Meng Qu, Victor Schmidt, Pierre-Luc St-Charles, Hannah Alsdurf, Olexa Bilanuik, David Buckeridge, Gáetan Marceau Caron, Pierre-Luc Carrier, Joumana Ghosn, Satya Ortiz-Gagne, Chris Pal, Irina Rish, Bernhard Schölkopf, Abhinav Sharma, Jian Tang, Andrew Williams

Predictions are used to provide personalized recommendations to the individual via an app, as well as to send anonymized messages to the individual's contacts, who use this information to better predict their own infectiousness, an approach we call proactive contact tracing (PCT).

Paper
Code

Instrumental Variable Regression via Kernel Maximum Moment Loss

2 code implementations • 15 Oct 2020 • Rui Zhang, Masaaki Imaizumi, Bernhard Schölkopf, Krikamol Muandet

We investigate a simple objective for nonlinear instrumental variable (IV) regression based on a kernelized conditional moment restriction (CMR) known as a maximum moment restriction (MMR).

regression

Paper
Code

Function Contrastive Learning of Transferable Meta-Representations

no code implementations • 14 Oct 2020 • Muhammad Waleed Gondal, Shruti Joshi, Nasim Rahaman, Stefan Bauer, Manuel Wüthrich, Bernhard Schölkopf

This \emph{meta-representation}, which is computed from a few observed examples of the underlying function, is learned jointly with the predictive model.

Contrastive Learning Few-Shot Learning

Paper
Add Code

On the Fairness of Causal Algorithmic Recourse

1 code implementation • 13 Oct 2020 • Julius von Kügelgen, Amir-Hossein Karimi, Umang Bhatt, Isabel Valera, Adrian Weller, Bernhard Schölkopf

Algorithmic fairness is typically studied from the perspective of predictions.

counterfactual Fairness

Paper
Code

Physically constrained causal noise models for high-contrast imaging of exoplanets

no code implementations • 12 Oct 2020 • Timothy D. Gebhard, Markus J. Bonse, Sascha P. Quanz, Bernhard Schölkopf

The detection of exoplanets in high-contrast imaging (HCI) data hinges on post-processing methods to remove spurious light from the host star.

Vocal Bursts Intensity Prediction

Paper
Add Code

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

1 code implementation • ICLR 2021 • Ossama Ahmed, Frederik Träuble, Anirudh Goyal, Alexander Neitz, Yoshua Bengio, Bernhard Schölkopf, Manuel Wüthrich, Stefan Bauer

To facilitate research addressing this problem, we propose CausalWorld, a benchmark for causal structure and transfer learning in a robotic manipulation environment.

Reinforcement Learning (RL) Transfer Learning

201

Paper
Code

A survey of algorithmic recourse: definitions, formulations, solutions, and prospects

no code implementations • 8 Oct 2020 • Amir-Hossein Karimi, Gilles Barthe, Bernhard Schölkopf, Isabel Valera

Machine learning is increasingly used to inform decision-making in sensitive situations where decisions have consequential effects on individuals' lives.

Decision Making Fairness

Paper
Add Code

Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning

1 code implementation • 7 Oct 2020 • Sumedh A. Sontakke, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf

Inspired by this, we attempt to equip reinforcement learning agents with the ability to perform experiments that facilitate a categorization of the rolled-out trajectories, and to subsequently infer the causal factors of the environment in a hierarchical manner.

Representation Learning Zero-Shot Learning

Paper
Code

Function Contrastive Learning of Transferable Representations

no code implementations • 28 Sep 2020 • Muhammad Waleed Gondal, Shruti Joshi, Nasim Rahaman, Stefan Bauer, Manuel Wuthrich, Bernhard Schölkopf

Few-shot-learning seeks to find models that are capable of fast-adaptation to novel tasks which are not encountered during training.

Contrastive Learning Few-Shot Learning

Paper
Add Code

Learning explanations that are hard to vary

3 code implementations • ICLR 2021 • Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto, Luigi Gresele, Bernhard Schölkopf

In this paper, we investigate the principle that `good explanations are hard to vary' in the context of deep learning.

Memorization

4,455

Paper
Code

Real-time Prediction of COVID-19 related Mortality using Electronic Health Records

no code implementations • 31 Aug 2020 • Patrick Schwab, Arash Mehrjou, Sonali Parbhoo, Leo Anthony Celi, Jürgen Hetzel, Markus Hofer, Bernhard Schölkopf, Stefan Bauer

Coronavirus Disease 2019 (COVID-19) is an emerging respiratory disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) with rapid human-to-human transmission and a high case fatality rate particularly in older patients.

Specificity

Paper
Add Code

Learning Dynamical Systems using Local Stability Priors

no code implementations • 23 Aug 2020 • Arash Mehrjou, Andrea Iannelli, Bernhard Schölkopf

A coupled computational approach to simultaneously learn a vector field and the region of attraction of an equilibrium point from generated trajectories of the system is proposed.

Paper
Add Code

TriFinger: An Open-Source Robot for Learning Dexterity

2 code implementations • 8 Aug 2020 • Manuel Wüthrich, Felix Widmaier, Felix Grimminger, Joel Akpo, Shruti Joshi, Vaibhav Agrawal, Bilal Hammoud, Majid Khadiv, Miroslav Bogdanovic, Vincent Berenz, Julian Viereck, Maximilien Naveau, Ludovic Righetti, Bernhard Schölkopf, Stefan Bauer

Dexterous object manipulation remains an open problem in robotics, despite the rapid progress in machine learning during the past decade.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Geometrically Enriched Latent Spaces

no code implementations • 2 Aug 2020 • Georgios Arvanitidis, Søren Hauberg, Bernhard Schölkopf

A common assumption in generative models is that the generator immerses the latent space into a Euclidean ambient space.

Paper
Add Code

A Commentary on the Unsupervised Learning of Disentangled Representations

no code implementations • 28 Jul 2020 • Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

The goal of the unsupervised learning of disentangled representations is to separate the independent explanatory factors of variation in the data without access to supervision.

Paper
Add Code

S2RMs: Spatially Structured Recurrent Modules

no code implementations • 13 Jul 2020 • Nasim Rahaman, Anirudh Goyal, Muhammad Waleed Gondal, Manuel Wuthrich, Stefan Bauer, Yash Sharma, Yoshua Bengio, Bernhard Schölkopf

Capturing the structure of a data-generating process by means of appropriate inductive biases can help in learning models that generalize well and are robust to changes in the input distribution.

Starcraft II Video Prediction

Paper
Add Code

Causal Feature Selection via Orthogonal Search

no code implementations • 6 Jul 2020 • Ashkan Soleymani, Anant Raj, Stefan Bauer, Bernhard Schölkopf, Michel Besserve

The problem of inferring the direct causal parents of a response variable among a large set of explanatory variables is of high practical importance in many disciplines.

Causal Discovery feature selection

Paper
Add Code

Relative gradient optimization of the Jacobian term in unsupervised deep learning

1 code implementation • NeurIPS 2020 • Luigi Gresele, Giancarlo Fissore, Adrián Javaloy, Bernhard Schölkopf, Aapo Hyvärinen

Learning expressive probabilistic models correctly describing the data is a ubiquitous problem in machine learning.

Paper
Code

Metrizing Weak Convergence with Maximum Mean Discrepancies

no code implementations • 16 Jun 2020 • Carl-Johann Simon-Gabriel, Alessandro Barp, Bernhard Schölkopf, Lester Mackey

More precisely, we prove that, on a locally compact, non-compact, Hausdorff space, the MMD of a bounded continuous Borel measurable kernel k, whose reproducing kernel Hilbert space (RKHS) functions vanish at infinity, metrizes the weak convergence of probability measures if and only if k is continuous and integrally strictly positive definite (i. s. p. d.)

Paper
Add Code

On Disentangled Representations Learned From Correlated Data

2 code implementations • 14 Jun 2020 • Frederik Träuble, Elliot Creager, Niki Kilbertus, Francesco Locatello, Andrea Dittadi, Anirudh Goyal, Bernhard Schölkopf, Stefan Bauer

The focus of disentanglement approaches has been on identifying independent factors of variation in data.

Disentanglement Fairness

Paper
Code

Structure by Architecture: Structured Representations without Regularization

no code implementations • 14 Jun 2020 • Felix Leeb, Guilia Lanzillotta, Yashas Annadani, Michel Besserve, Stefan Bauer, Bernhard Schölkopf

We study the problem of self-supervised structured representation learning using autoencoders for downstream tasks such as generative modeling.

Disentanglement

Paper
Add Code

Kernel Distributionally Robust Optimization

2 code implementations • 12 Jun 2020 • Jia-Jie Zhu, Wittawat Jitkrittum, Moritz Diehl, Bernhard Schölkopf

We prove a theorem that generalizes the classical duality in the mathematical problem of moments.

Stochastic Optimization

Paper
Code

Algorithmic recourse under imperfect causal knowledge: a probabilistic approach

1 code implementation • NeurIPS 2020 • Amir-Hossein Karimi, Julius von Kügelgen, Bernhard Schölkopf, Isabel Valera

Recent work has discussed the limitations of counterfactual explanations to recommend actions for algorithmic recourse, and argued for the need of taking causal relationships between features into consideration.

counterfactual

Paper
Code

Learning to Play Table Tennis From Scratch using Muscular Robots

no code implementations • 10 Jun 2020 • Dieter Büchler, Simon Guist, Roberto Calandra, Vincent Berenz, Bernhard Schölkopf, Jan Peters

This work is the first to (a) fail-safe learn of a safety-critical dynamic task using anthropomorphic robot arms, (b) learn a precision-demanding problem with a PAM-driven system despite the control challenges and (c) train robots to play table tennis without real balls.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Neural Lyapunov Redesign

1 code implementation • 6 Jun 2020 • Arash Mehrjou, Mohammad Ghavamzadeh, Bernhard Schölkopf

We provide theoretical results on the class of systems that can be treated with the proposed algorithm and empirically evaluate the effectiveness of our method using an exemplary dynamical system.

Paper
Code

Learning Kernel Tests Without Data Splitting

1 code implementation • NeurIPS 2020 • Jonas M. Kübler, Wittawat Jitkrittum, Bernhard Schölkopf, Krikamol Muandet

Modern large-scale kernel-based tests such as maximum mean discrepancy (MMD) and kernelized Stein discrepancy (KSD) optimize kernel hyperparameters on a held-out sample via data splitting to obtain the most powerful test statistics.

Paper
Code

A machine learning route between band mapping and band structure

1 code implementation • 20 May 2020 • Rui Patrick Xian, Vincent Stimper, Marios Zacharias, Shuo Dong, Maciej Dendzik, Samuel Beaulieu, Bernhard Schölkopf, Martin Wolf, Laurenz Rettig, Christian Carbogno, Stefan Bauer, Ralph Ernstorfer

Electronic band structure (BS) and crystal structure are the two complementary identifiers of solid state materials.

Data Analysis, Statistics and Probability Materials Science Computational Physics

Paper
Code

Necessary and sufficient conditions for causal feature selection in time series with latent common causes

no code implementations • 18 May 2020 • Atalanti A. Mastakouri, Bernhard Schölkopf, Dominik Janzing

We study the identification of direct and indirect causes on time series and provide conditions in the presence of latent variables, which we prove to be necessary and sufficient under some graph constraints.

feature selection Time Series +1

Paper
Add Code

Simpson's paradox in Covid-19 case fatality rates: a mediation analysis of age-related causal effects

1 code implementation • 14 May 2020 • Julius von Kügelgen, Luigi Gresele, Bernhard Schölkopf

We point out limitations and extensions for future work, and, finally, discuss the role of causal reasoning in the broader context of using AI to combat the Covid-19 pandemic.

Applications Methodology

Paper
Code

Crackovid: Optimizing Group Testing

no code implementations • 13 May 2020 • Louis Abraham, Gary Bécigneul, Bernhard Schölkopf

We study the problem usually referred to as group testing in the context of COVID-19.

Paper
Add Code

Disentangling Factors of Variations Using Few Labels

no code implementations • ICLR Workshop LLD 2019 • Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem

Recently, Locatello et al. (2019) demonstrated that unsupervised disentanglement learning without inductive biases is theoretically impossible and that existing inductive biases and unsupervised methods do not allow to consistently learn disentangled representations.

Disentanglement Model Selection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.