Search Results for author: R\' emi Munos

Cheap Bandits

We consider stochastic sequential learning problems where the learner can observe the \textit{average reward of several actions}.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.