Search Results for author: Baekjin Kim

Found 4 papers, 2 papers with code

Weighted Gaussian Process Bandits for Non-stationary Environments

no code implementations6 Jul 2021 Yuntian Deng, Xingyu Zhou, Baekjin Kim, Ambuj Tewari, Abhishek Gupta, Ness Shroff

To this end, we develop WGP-UCB, a novel UCB-type algorithm based on weighted Gaussian process regression.

regression

Randomized Exploration for Non-Stationary Stochastic Linear Bandits

2 code implementations11 Dec 2019 Baekjin Kim, Ambuj Tewari

We investigate two perturbation approaches to overcome conservatism that optimism based algorithms chronically suffer from in practice.

Computational Efficiency Thompson Sampling

On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems

2 code implementations NeurIPS 2019 Baekjin Kim, Ambuj Tewari

We investigate the optimality of perturbation based algorithms in the stochastic and adversarial multi-armed bandit problems.

Cannot find the paper you are looking for? You can Submit a new open access paper.