1 code implementation • ICML 2020 • REDA ALAMI, Odalric-Ambrym Maillard, Raphaël Féraud
In this paper, we consider the problem of sequential change-point detection where both the change-points and the distributions before and after the change are assumed to be unknown.
no code implementations • 15 Oct 2020 • Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish
In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration;however, the agent has a freedom to choose which variables to observe.
no code implementations • 19 Nov 2018 • Raphaël Féraud, Réda Alami, Romain Laroche
We consider the decentralized exploration problem: a set of players collaborate to identify the best arm by asynchronously interacting with the same stochastic environment.
no code implementations • 7 Sep 2016 • Robin Allesiardo, Raphaël Féraud, Odalric-Ambrym Maillard
For the best-arm identification task, we introduce a version of Successive Elimination based on random shuffling of the $K$ arms.
no code implementations • 11 Feb 2016 • Raphaël Féraud
We provide a first algorithm, Distributed Median Elimination, which is optimal in term of number of transmitted bits and near optimal in term of speed-up factor with respect to an optimal algorithm run independently on each player.
no code implementations • 27 Apr 2015 • Raphaël Féraud, Robin Allesiardo, Tanguy Urvoy, Fabrice Clérot
The dependence of the sample complexity upon the number of contextual variables is logarithmic.