Search Results for author: Jayakrishnan Nair

Found 10 papers, 3 papers with code

Capacity Provisioning Motivated Online Non-Convex Optimization Problem with Memory and Switching Cost

no code implementations26 Mar 2024 Rahul Vaze, Jayakrishnan Nair

An online non-convex optimization problem is considered where the goal is to minimize the flow time (total delay) of a set of jobs by modulating the number of active servers, but with a switching cost associated with changing the number of active servers over time.

Best Arm Identification in Bandits with Limited Precision Sampling

no code implementations10 May 2023 Kota Srinivas Reddy, P. N. Karthik, Nikhil Karamchandani, Jayakrishnan Nair

The pulled arm and its instantaneous reward are revealed to the learner, whose goal is to find the best arm by minimising the expected stopping time, subject to an upper bound on the error probability.

Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget

no code implementations27 Nov 2022 Fathima Zarin Faizal, Jayakrishnan Nair

A key feature of this algorithm is that it is designed on the basis of an information theoretic lower bound for two-armed instances.

Attribute Multi-Armed Bandits +1

Unsupervised Crowdsourcing with Accuracy and Cost Guarantees

no code implementations5 Jul 2022 Yashvardhan Didwania, Jayakrishnan Nair, N. Hemachandra

We consider the problem of cost-optimal utilization of a crowdsourcing platform for binary, unsupervised classification of a collection of items, given a prescribed error threshold.

Sequential Community Mode Estimation

no code implementations16 Nov 2021 Shubham Anand Jain, Shreyas Goenka, Divyam Bapna, Nikhil Karamchandani, Jayakrishnan Nair

We propose and analyse novel algorithms for this problem, and also establish information theoretic lower bounds on the probability of error under any algorithm.

Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning

no code implementations15 Sep 2021 Vivek Deulkar, Jayakrishnan Nair

We consider the problem of optimal charging/discharging of a bank of heterogenous battery units, driven by stochastic electricity generation and demand processes.

Q-Learning reinforcement-learning +1

Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits

no code implementations28 Aug 2020 Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

In this paper, we show that specialized algorithms that exploit such parametric information are prone to inconsistent learning performance when the parameter is misspecified.

Multi-Armed Bandits

Bandit algorithms: Letting go of logarithmic regret for statistical robustness

1 code implementation22 Jun 2020 Kumar Ashutosh, Jayakrishnan Nair, Anmol Kagrecha, Krishna Jagannathan

We study regret minimization in a stochastic multi-armed bandit setting and establish a fundamental trade-off between the regret suffered under an algorithm, and its statistical robustness.

Constrained regret minimization for multi-criterion multi-armed bandits

1 code implementation17 Jun 2020 Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

We consider a stochastic multi-armed bandit setting and study the problem of constrained regret minimization over a given time horizon.

Attribute Multi-Armed Bandits +1

Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards

1 code implementation NeurIPS 2019 Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

We also compare the error bounds for our distribution oblivious algorithms with those corresponding to standard non-oblivious algorithms.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.