no code implementations • 16 Jun 2017 • Ruitong Huang, Mohammad M. Ajallooeian, Csaba Szepesvári, Martin Müller
We study the problem of identifying the best action among a set of possible options when the value of each action is given by a mapping from a number of noisy micro-observables in the so-called fixed confidence setting.