no code implementations • 4 Dec 2018 • Mohammad Naghshvar, Ahmed K. Sadek, Auke J. Wiggers
It is shown that upper confidence bound (UCB) for expanding the tree results in noisy Q-value estimates by the MCTS and a degraded performance of QMDP.
no code implementations • 22 Jun 2016 • Auke J. Wiggers, Frans A. Oliehoek, Diederik M. Roijers
Zero-sum stochastic games provide a rich model for competitive decision making.