Search Results for author: Yonatan Gur

Found 7 papers, 2 papers with code

Incentivized Exploration via Filtered Posterior Sampling

no code implementations20 Feb 2024 Anand Kalvit, Aleksandrs Slivkins, Yonatan Gur

We study "incentivized exploration" (IE) in social learning problems where the principal (a recommendation algorithm) can leverage information asymmetry to incentivize sequentially-arriving agents to take exploratory actions.

Multi-Armed Bandits

Information Disclosure and Promotion Policy Design for Platforms

no code implementations21 Nov 2019 Yonatan Gur, Gregory Macnamara, Ilan Morgenstern, Daniela Saban

Identifying effective joint information design and promotion policies is a challenging dynamic problem as sellers can sequentially learn the promotion value from sales observations and update prices accordingly.

Smoothness-Adaptive Contextual Bandits

1 code implementation22 Oct 2019 Yonatan Gur, Ahmadreza Momeni, Stefan Wager

In this work, we consider a framework where the smoothness of payoff functions is not known, and study when and how algorithms may adapt to unknown smoothness.

Decision Making Multi-Armed Bandits

Adaptive Sequential Experiments with Unknown Information Arrival Processes

no code implementations28 Jun 2019 Yonatan Gur, Ahmadreza Momeni

When it is known how to map auxiliary data to reward estimates, by obtaining matching lower and upper bounds we characterize a spectrum of minimax complexities for this class of problems as a function of the information arrival process, which captures how salient characteristics of this process impact achievable performance.

Adaptive Learning with Unknown Information Flows

no code implementations NeurIPS 2018 Yonatan Gur, Ahmadreza Momeni

We introduce an adaptive exploration policy that, without any prior knowledge of the information arrival process, attains the best performance (in terms of regret rate) that is achievable when the information arrival process is a priori known.

Decision Making

Stochastic Multi-Armed-Bandit Problem with Non-stationary Rewards

no code implementations NeurIPS 2014 Omar Besbes, Yonatan Gur, Assaf Zeevi

In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution.

Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards

1 code implementation13 May 2014 Omar Besbes, Yonatan Gur, Assaf Zeevi

In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution.

Cannot find the paper you are looking for? You can Submit a new open access paper.