no code implementations • 25 Dec 2017 • David Liau, Eric Price, Zhao Song, Ger Yang
We consider the stochastic bandit problem in the sublinear space setting, where one cannot record the win-loss record for all $K$ arms.
Multi-Armed Bandits