1 code implementation • NeurIPS 2021 • Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris
A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov.
no code implementations • 5 Feb 2020 • Kavosh Asadi, Neev Parikh, Ronald E. Parr, George D. Konidaris, Michael L. Littman
We show that the maximum action-value with respect to a deep RBVF can be approximated easily and accurately.