Search Results for author: Rong Jiang

Found 1 papers, 0 papers with code

Batched Nonparametric Contextual Bandits

no code implementations27 Feb 2024 Rong Jiang, Cong Ma

We study nonparametric contextual bandits under batch constraints, where the expected reward for each action is modeled as a smooth function of covariates, and the policy updates are made at the end of each batch of observations.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.