1 code implementation • 22 Dec 2020 • Rinu Boney, Alexander Ilin, Juho Kannala, Jarno Seppänen
We experimentally show that planning with naive Monte Carlo tree search does not perform very well in large combinatorial action spaces.
Thompson Sampling