no code implementations • 17 Nov 2016 • Stefano Paladino, Francesco Trovò, Marcello Restelli, Nicola Gatti
We study, to the best of our knowledge, the first Bayesian algorithm for unimodal Multi-Armed Bandit (MAB) problems with graph structure.
Thompson Sampling