Search Results for author: Ethan Waldie

Found 2 papers, 1 papers with code

Learning Reward Machines: A Study in Partially Observable Reinforcement Learning

no code implementations17 Dec 2021 Rodrigo Toro Icarte, Ethan Waldie, Toryn Q. Klassen, Richard Valenzano, Margarita P. Castro, Sheila A. McIlraith

Here we show that RMs can be learned from experience, instead of being specified by the user, and that the resulting problem decomposition can be used to effectively solve partially observable RL problems.

Partially Observable Reinforcement Learning Problem Decomposition +2

Learning Reward Machines for Partially Observable Reinforcement Learning

1 code implementation NeurIPS 2019 Rodrigo Toro Icarte, Ethan Waldie, Toryn Klassen, Rick Valenzano, Margarita Castro, Sheila Mcilraith

Reward Machines (RMs), originally proposed for specifying problems in Reinforcement Learning (RL), provide a structured, automata-based representation of a reward function that allows an agent to decompose problems into subproblems that can be efficiently learned using off-policy learning.

Partially Observable Reinforcement Learning Problem Decomposition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.