Search Results for author: Odalric-Ambryn Maillard

Found 1 papers, 1 papers with code

Optimal Thompson Sampling strategies for support-aware CVaR bandits

1 code implementation • 10 Dec 2020 • Dorian Baudry, Romain Gautron, Emilie Kaufmann, Odalric-Ambryn Maillard

In this paper we study a multi-arm bandit problem in which the quality of each arm is measured by the Conditional Value at Risk (CVaR) at some level alpha of the reward distribution.

Thompson Sampling

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.