Search Results for author: Dmytro Mykhaylov

Found 2 papers, 0 papers with code

Learning from Bandit Feedback: An Overview of the State-of-the-art

no code implementations • 18 Sep 2019 • Olivier Jeunen, Dmytro Mykhaylov, David Rohde, Flavian vasile, Alexandre Gilotte, Martin Bompaire

In order to handle this "bandit-feedback" setting, several Counterfactual Risk Minimisation (CRM) methods have been proposed in recent years, that attempt to estimate the performance of different policies on historical data.

counterfactual Recommendation Systems

Paper
Add Code

Three Methods for Training on Bandit Feedback

no code implementations • 24 Apr 2019 • Dmytro Mykhaylov, David Rohde, Flavian vasile, Martin Bompaire, Olivier Jeunen

There are three quite distinct ways to train a machine learning model on recommender system logs.

Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.