no code implementations • 2 Apr 2024 • Golnaz Mesbahi, Olya Mastikhina, Parham Mohammad Panahi, Martha White, Adam White
In this paper we propose a new approach for tuning and evaluating lifelong RL agents where only one percent of the experiment data can be used for hyperparameter tuning.
no code implementations • 6 Jun 2022 • Chunlok Lo, Kevin Roice, Parham Mohammad Panahi, Scott Jordan, Adam White, Gabor Mihucz, Farzane Aminmansour, Martha White
In this paper, we avoid this limitation by constraining background planning to a set of (abstract) subgoals and learning only local, subgoal-conditioned models.
Model-based Reinforcement Learning Reinforcement Learning (RL)