no code implementations • 6 Jul 2021 • Yuntian Deng, Xingyu Zhou, Baekjin Kim, Ambuj Tewari, Abhishek Gupta, Ness Shroff
To this end, we develop WGP-UCB, a novel UCB-type algorithm based on weighted Gaussian process regression.
no code implementations • NeurIPS 2020 • Young Hun Jung, Baekjin Kim, Ambuj Tewari
First, we show that private learnability implies online learnability in both settings.
2 code implementations • 11 Dec 2019 • Baekjin Kim, Ambuj Tewari
We investigate two perturbation approaches to overcome conservatism that optimism based algorithms chronically suffer from in practice.
2 code implementations • NeurIPS 2019 • Baekjin Kim, Ambuj Tewari
We investigate the optimality of perturbation based algorithms in the stochastic and adversarial multi-armed bandit problems.