Search Results for author: Romain Deffayet

Found 6 papers, 3 papers with code

Unbiased Learning to Rank Meets Reality: Lessons from Baidu's Large-Scale Search Dataset

2 code implementations3 Apr 2024 Philipp Hager, Romain Deffayet, Jean-Michel Renders, Onno Zoeter, Maarten de Rijke

Our experiments reveal that gains in click prediction do not necessarily translate to enhanced ranking performance on expert relevance annotations, implying that conclusions strongly depend on how success is measured in this benchmark.

Learning-To-Rank

SARDINE: A Simulator for Automated Recommendation in Dynamic and Interactive Environments

1 code implementation28 Nov 2023 Romain Deffayet, Thibaut Thonet, Dongyoon Hwang, Vassilissa Lehoux, Jean-Michel Renders, Maarten de Rijke

Simulators can provide valuable insights for researchers and practitioners who wish to improve recommender systems, because they allow one to easily tweak the experimental setup in which recommender systems operate, and as a result lower the cost of identifying general trends and uncovering novel findings about the candidate methods.

counterfactual Learning-To-Rank +1

Distributional Reinforcement Learning with Dual Expectile-Quantile Regression

no code implementations26 May 2023 Sami Jullien, Romain Deffayet, Jean-Michel Renders, Paul Groth, Maarten de Rijke

Motivated by the efficiency of $L_2$-based learning, we propose to jointly learn expectiles and quantiles of the return distribution in a way that allows efficient learning while keeping an estimate of the full distribution of returns.

Continuous Control Distributional Reinforcement Learning +3

An Offline Metric for the Debiasedness of Click Models

2 code implementations19 Apr 2023 Romain Deffayet, Philipp Hager, Jean-Michel Renders, Maarten de Rijke

We prove that debiasedness is a necessary condition for recovering unbiased and consistent relevance scores and for the invariance of click prediction under covariate shift.

counterfactual Learning-To-Rank +1

Generative Slate Recommendation with Reinforcement Learning

no code implementations20 Jan 2023 Romain Deffayet, Thibaut Thonet, Jean-Michel Renders, Maarten de Rijke

Our findings suggest that representation learning using generative models is a promising direction towards generalizable RL-based slate recommendation.

Recommendation Systems reinforcement-learning +2

Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives

no code implementations3 Jan 2023 Romain Deffayet, Thibaut Thonet, Jean-Michel Renders, Maarten de Rijke

In this paper, we argue that the paradigm commonly adopted for offline evaluation of sequential recommender systems is unsuitable for evaluating reinforcement learning-based recommenders.

Offline RL Recommendation Systems +2

Cannot find the paper you are looking for? You can Submit a new open access paper.