no code implementations • 26 Aug 2020 • Alan Chan, Kris de Asis, Richard S. Sutton
In this work, we explore the use of \textit{inverse policy evaluation}, the process of solving for a likely policy given a value function, for deriving behavior from a value function.