Search Results for author: Baekjin Kim

Weighted Gaussian Process Bandits for Non-stationary Environments

To this end, we develop WGP-UCB, a novel UCB-type algorithm based on weighted Gaussian process regression.

Paper
Add Code

First, we show that private learnability implies online learnability in both settings.

Paper
Add Code

We investigate two perturbation approaches to overcome conservatism that optimism based algorithms chronically suffer from in practice.

Paper
Code

We investigate the optimality of perturbation based algorithms in the stochastic and adversarial multi-armed bandit problems.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.