Search Results for author: Raphaël Féraud

Found 6 papers, 1 papers with code

Restarted Bayesian Online Change-point Detector achieves Optimal Detection Delay

1 code implementation • ICML 2020 • REDA ALAMI, Odalric-Ambrym Maillard, Raphaël Féraud

In this paper, we consider the problem of sequential change-point detection where both the change-points and the distributions before and after the change are assumed to be unknown.

Change Point Detection Learning Theory

Paper
Code

Double-Linear Thompson Sampling for Context-Attentive Bandits

no code implementations • 15 Oct 2020 • Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish

In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration;however, the agent has a freedom to choose which variables to observe.

Medical Diagnosis Thompson Sampling

Paper
Add Code

Decentralized Exploration in Multi-Armed Bandits -- Extended version

no code implementations • 19 Nov 2018 • Raphaël Féraud, Réda Alami, Romain Laroche

We consider the decentralized exploration problem: a set of players collaborate to identify the best arm by asynchronously interacting with the same stochastic environment.

Multi-Armed Bandits

Paper
Add Code

Random Shuffling and Resets for the Non-stationary Stochastic Bandit Problem

no code implementations • 7 Sep 2016 • Robin Allesiardo, Raphaël Féraud, Odalric-Ambrym Maillard

For the best-arm identification task, we introduce a version of Successive Elimination based on random shuffling of the $K$ arms.

Paper
Add Code

Network of Bandits insure Privacy of end-users

no code implementations • 11 Feb 2016 • Raphaël Féraud

We provide a first algorithm, Distributed Median Elimination, which is optimal in term of number of transmitted bits and near optimal in term of speed-up factor with respect to an optimal algorithm run independently on each player.

Paper
Add Code

Random Forest for the Contextual Bandit Problem - extended version

no code implementations • 27 Apr 2015 • Raphaël Féraud, Robin Allesiardo, Tanguy Urvoy, Fabrice Clérot

The dependence of the sample complexity upon the number of contextual variables is logarithmic.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.