Search Results for author: Nihal Sharma

Found 4 papers, 1 papers with code

Episodic Bandits with Stochastic Experts

no code implementations7 Jul 2021 Nihal Sharma, Soumya Basu, Karthikeyan Shanmugam, Sanjay Shakkottai

The agent interacts with the environment over episodes, with each episode having different context distributions; this results in the `best expert' changing across episodes.

On Under-exploration in Bandits with Mean Bounds from Confounded Data

no code implementations19 Feb 2020 Nihal Sharma, Soumya Basu, Karthikeyan Shanmugam, Sanjay Shakkottai

We study a variant of the multi-armed bandit problem where side information in the form of bounds on the mean of each arm is provided.

Contextual Bandits with Stochastic Experts

1 code implementation23 Feb 2018 Rajat Sen, Karthikeyan Shanmugam, Nihal Sharma, Sanjay Shakkottai

We consider the problem of contextual bandits with stochastic experts, which is a variation of the traditional stochastic contextual bandit with experts problem.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.