Search Results for author: Jayakrishnan Nair

Found 10 papers, 3 papers with code

Capacity Provisioning Motivated Online Non-Convex Optimization Problem with Memory and Switching Cost

no code implementations • 26 Mar 2024 • Rahul Vaze, Jayakrishnan Nair

An online non-convex optimization problem is considered where the goal is to minimize the flow time (total delay) of a set of jobs by modulating the number of active servers, but with a switching cost associated with changing the number of active servers over time.

Paper
Add Code

Best Arm Identification in Bandits with Limited Precision Sampling

no code implementations • 10 May 2023 • Kota Srinivas Reddy, P. N. Karthik, Nikhil Karamchandani, Jayakrishnan Nair

The pulled arm and its instantaneous reward are revealed to the learner, whose goal is to find the best arm by minimising the expected stopping time, subject to an upper bound on the error probability.

Paper
Add Code

Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget

no code implementations • 27 Nov 2022 • Fathima Zarin Faizal, Jayakrishnan Nair

A key feature of this algorithm is that it is designed on the basis of an information theoretic lower bound for two-armed instances.

Attribute Multi-Armed Bandits +1

Paper
Add Code

Unsupervised Crowdsourcing with Accuracy and Cost Guarantees

no code implementations • 5 Jul 2022 • Yashvardhan Didwania, Jayakrishnan Nair, N. Hemachandra

We consider the problem of cost-optimal utilization of a crowdsourcing platform for binary, unsupervised classification of a collection of items, given a prescribed error threshold.

Paper
Add Code

Sequential Community Mode Estimation

no code implementations • 16 Nov 2021 • Shubham Anand Jain, Shreyas Goenka, Divyam Bapna, Nikhil Karamchandani, Jayakrishnan Nair

We propose and analyse novel algorithms for this problem, and also establish information theoretic lower bounds on the probability of error under any algorithm.

Paper
Add Code

Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning

no code implementations • 15 Sep 2021 • Vivek Deulkar, Jayakrishnan Nair

We consider the problem of optimal charging/discharging of a bank of heterogenous battery units, driven by stochastic electricity generation and demand processes.

Q-Learning reinforcement-learning +1

Paper
Add Code

Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits

no code implementations • 28 Aug 2020 • Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

In this paper, we show that specialized algorithms that exploit such parametric information are prone to inconsistent learning performance when the parameter is misspecified.

Multi-Armed Bandits

Paper
Add Code

Bandit algorithms: Letting go of logarithmic regret for statistical robustness

1 code implementation • 22 Jun 2020 • Kumar Ashutosh, Jayakrishnan Nair, Anmol Kagrecha, Krishna Jagannathan

We study regret minimization in a stochastic multi-armed bandit setting and establish a fundamental trade-off between the regret suffered under an algorithm, and its statistical robustness.

Paper
Code

Constrained regret minimization for multi-criterion multi-armed bandits

1 code implementation • 17 Jun 2020 • Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

We consider a stochastic multi-armed bandit setting and study the problem of constrained regret minimization over a given time horizon.

Attribute Multi-Armed Bandits +1

Paper
Code

Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards

1 code implementation • NeurIPS 2019 • Anmol Kagrecha, Jayakrishnan Nair, Krishna Jagannathan

We also compare the error bounds for our distribution oblivious algorithms with those corresponding to standard non-oblivious algorithms.

Multi-Armed Bandits

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.