Search Results for author: Arnaud Fickinger

Found 6 papers, 4 papers with code

Cross-Domain Imitation Learning via Optimal Transport

1 code implementation • ICLR 2022 • Arnaud Fickinger, samuel cohen, Stuart Russell, Brandon Amos

Cross-domain imitation learning studies how to leverage expert demonstrations of one agent to train an imitation agent with a different embodiment or morphology.

Continuous Control Imitation Learning

Paper
Code

Scalable Online Planning via Reinforcement Learning Fine-Tuning

1 code implementation • NeurIPS 2021 • Arnaud Fickinger, Hengyuan Hu, Brandon Amos, Stuart Russell, Noam Brown

Lookahead search has been a critical component of recent AI successes, such as in the games of chess, go, and poker.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Explore and Control with Adversarial Surprise

1 code implementation • ICML Workshop URL 2021 • Arnaud Fickinger, Natasha Jaques, Samyak Parajuli, Michael Chang, Nicholas Rhinehart, Glen Berseth, Stuart Russell, Sergey Levine

Unsupervised reinforcement learning (RL) studies how to leverage environment statistics to learn useful behaviors without the cost of reward engineering.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Multi-Principal Assistance Games: Definition and Collegial Mechanisms

no code implementations • 29 Dec 2020 • Arnaud Fickinger, Simon Zhuang, Andrew Critch, Dylan Hadfield-Menell, Stuart Russell

We introduce the concept of a multi-principal assistance game (MPAG), and circumvent an obstacle in social choice theory, Gibbard's theorem, by using a sufficiently collegial preference inference mechanism.

Paper
Add Code

Multi-Principal Assistance Games

no code implementations • 19 Jul 2020 • Arnaud Fickinger, Simon Zhuang, Dylan Hadfield-Menell, Stuart Russell

Assistance games (also known as cooperative inverse reinforcement learning games) have been proposed as a model for beneficial AI, wherein a robotic agent must act on behalf of a human principal but is initially uncertain about the humans payoff function.

Paper
Add Code

Biadversarial Variational Autoencoder

1 code implementation • 9 Feb 2019 • Arnaud Fickinger

In the original version of the Variational Autoencoder, Kingma et al. assume Gaussian distributions for the approximate posterior during the inference and for the output during the generative process.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.