1 code implementation • NeurIPS 2023 • Raunak Kumar, Sarah Dean, Robert Kleinberg
As a special case, we prove the first non-trivial lower bound for OCO with finite memory \citep{anavaHM2015online}, which could be of independent interest, and also improve existing upper bounds.
1 code implementation • 24 Sep 2022 • Raunak Kumar, Robert Kleinberg
Bandits with knapsacks (BwK) is an influential model of sequential decision-making under uncertainty that incorporates resource consumption constraints.
no code implementations • 2 Nov 2020 • Frederik Kunstner, Raunak Kumar, Mark Schmidt
In this work we first show that for the common setting of exponential family distributions, viewing EM as a mirror descent algorithm leads to convergence rates in Kullback-Leibler (KL) divergence.