Search Results for author: Rong Jiang

Found 1 papers, 0 papers with code

Batched Nonparametric Contextual Bandits

no code implementations • 27 Feb 2024 • Rong Jiang, Cong Ma

We study nonparametric contextual bandits under batch constraints, where the expected reward for each action is modeled as a smooth function of covariates, and the policy updates are made at the end of each batch of observations.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.