Search Results for author: Nandan Sudarsanam

Found 5 papers, 0 papers with code

An Active Learning Framework for Efficient Robust Policy Search

no code implementations • 1 Jan 2019 • Sai Kiran Narayanaswami, Nandan Sudarsanam, Balaraman Ravindran

Robust Policy Search is the problem of learning policies that do not degrade in performance when subject to unseen environment model parameters.

Active Learning Continuous Control +1

Paper
Add Code

Rate of Change Analysis for Interestingness Measures

no code implementations • 14 Dec 2017 • Nandan Sudarsanam, Nishanth Kumar, Abhishek Sharma, Balaraman Ravindran

We present a comprehensive analysis of 50 interestingness measures and classify them in accordance with the two properties.

General Classification

Paper
Add Code

Efficient-UCBV: An Almost Optimal Algorithm using Variance Estimates

no code implementations • 9 Nov 2017 • Subhojyoti Mukherjee, K. P. Naveen, Nandan Sudarsanam, Balaraman Ravindran

We propose a novel variant of the UCB algorithm (referred to as Efficient-UCB-Variance (EUCBV)) for minimizing cumulative regret in the stochastic multi-armed bandit (MAB) setting.

Thompson Sampling

Paper
Add Code

Thresholding Bandits with Augmented UCB

no code implementations • 7 Apr 2017 • Subhojyoti Mukherjee, K. P. Naveen, Nandan Sudarsanam, Balaraman Ravindran

In this paper we propose the Augmented-UCB (AugUCB) algorithm for a fixed-budget version of the thresholding bandit problem (TBP), where the objective is to identify a set of arms whose quality is above a threshold.

Paper
Add Code

Linear Bandit algorithms using the Bootstrap

no code implementations • 4 May 2016 • Nandan Sudarsanam, Balaraman Ravindran

One of the proposed methods, X-Random bootstrap, performs better than the baselines in-terms of cumulative regret across various degrees of noise and different number of trials.

Thompson Sampling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.