no code implementations • 17 Oct 2013 • Tianbing Xu, Yaming Yu, John Turner, Amelia Regan
For the context bandit problems, Thompson Sampling is adopted based on the underlying posterior distributions of the parameters.
Thompson Sampling