no code implementations • NeurIPS 2015 • Huasen Wu, R. Srikant, Xin Liu, Chong Jiang
To the best of our knowledge, this is the first work that shows how to achieve logarithmic regret in constrained contextual bandits.
Multi-Armed Bandits