Search Results for author: Mattie Fellows

Found 3 papers, 1 papers with code

Refining Minimax Regret for Unsupervised Environment Design

1 code implementation • 19 Feb 2024 • Michael Beukman, Samuel Coward, Michael Matthews, Mattie Fellows, Minqi Jiang, Michael Dennis, Jakob Foerster

In this work, we introduce Bayesian level-perfect MMR (BLP), a refinement of the minimax regret objective that overcomes this limitation.

Paper
Code

Bayesian Exploration Networks

no code implementations • 24 Aug 2023 • Mattie Fellows, Brandon Kaplowitz, Christian Schroeder de Witt, Shimon Whiteson

Empirical results demonstrate that BEN can learn true Bayes-optimal policies in tasks where existing model-free approaches fail.

Decision Making Decision Making Under Uncertainty +4

Paper
Add Code

Why Target Networks Stabilise Temporal Difference Methods

no code implementations • 24 Feb 2023 • Mattie Fellows, Matthew J. A. Smith, Shimon Whiteson

Integral to recent successes in deep reinforcement learning has been a class of temporal difference methods that use infrequently updated target values for policy evaluation in a Markov Decision Process.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.