no code implementations • 28 Nov 2022 • MinGyu Park, Jaeuk Shin, Insoon Yang
Inspired by the quasi-Newton interpretation of AA, we propose a maximum entropy variant of QMDP, which we call soft QMDP, to fully benefit from AA.
no code implementations • 29 Mar 2021 • Melike Ermis, MinGyu Park, Insoon Yang
This paper proposes an accelerated method for approximately solving partially observable Markov decision process (POMDP) problems offline.