Search Results for author: Mattie Fellows

Found 3 papers, 1 papers with code

Refining Minimax Regret for Unsupervised Environment Design

1 code implementation19 Feb 2024 Michael Beukman, Samuel Coward, Michael Matthews, Mattie Fellows, Minqi Jiang, Michael Dennis, Jakob Foerster

In this work, we introduce Bayesian level-perfect MMR (BLP), a refinement of the minimax regret objective that overcomes this limitation.

Bayesian Exploration Networks

no code implementations24 Aug 2023 Mattie Fellows, Brandon Kaplowitz, Christian Schroeder de Witt, Shimon Whiteson

Empirical results demonstrate that BEN can learn true Bayes-optimal policies in tasks where existing model-free approaches fail.

Decision Making Decision Making Under Uncertainty +4

Why Target Networks Stabilise Temporal Difference Methods

no code implementations24 Feb 2023 Mattie Fellows, Matthew J. A. Smith, Shimon Whiteson

Integral to recent successes in deep reinforcement learning has been a class of temporal difference methods that use infrequently updated target values for policy evaluation in a Markov Decision Process.

Cannot find the paper you are looking for? You can Submit a new open access paper.