Search Results for author: Emmanuelle Claeys

Hyper-parameter Tuning for the Contextual Bandit

We study here the problem of learning the exploration exploitation trade-off in the contextual bandit problem with linear reward function setting.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.