Search Results for author: Rohan Deb

Found 6 papers, 0 papers with code

Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources

no code implementations • 28 Dec 2023 • Rohan Deb, Aadirupa Saha

We show that due to the relative nature of the feedback, the problem is more difficult than its bandit counterpart and that without further assumptions the problem is not learnable from a regret minimization perspective.

Paper
Add Code

Contextual Bandits with Online Neural Regression

no code implementations • 12 Dec 2023 • Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee

Based on such a perturbed prediction, we show a ${\mathcal{O}}(\log T)$ regret for online regression with both squared loss and KL loss, and subsequently convert these respectively to $\tilde{\mathcal{O}}(\sqrt{KT})$ and $\tilde{\mathcal{O}}(\sqrt{KL^*} + K)$ regret for NeuCB, where $L^*$ is the loss of the best policy.

Multi-Armed Bandits regression

Paper
Add Code

$N$-Timescale Stochastic Approximation: Stability and Convergence

no code implementations • 7 Dec 2021 • Rohan Deb, Shalabh Bhatnagar

This paper presents the first sufficient conditions that guarantee the stability and almost sure convergence of $N$-timescale stochastic approximation (SA) iterates for any $N\geq1$.

Paper
Add Code

Schedule Based Temporal Difference Algorithms

no code implementations • 23 Nov 2021 • Rohan Deb, Meet Gandhi, Shalabh Bhatnagar

However, the weights assigned to different $n$-step returns in TD($\lambda$), controlled by the parameter $\lambda$, decrease exponentially with increasing $n$.

Paper
Add Code

Gradient Temporal Difference with Momentum: Stability and Convergence

no code implementations • 22 Nov 2021 • Rohan Deb, Shalabh Bhatnagar

Here, we consider Gradient TD algorithms with an additional heavy ball momentum term and provide choice of step size and momentum parameter that ensures almost sure convergence of these algorithms asymptotically.

Paper
Add Code

Does Momentum Help? A Sample Complexity Analysis

no code implementations • 29 Oct 2021 • Swetha Ganesh, Rohan Deb, Gugan Thoppe, Amarjit Budhiraja

Stochastic Heavy Ball (SHB) and Nesterov's Accelerated Stochastic Gradient (ASG) are popular momentum methods in stochastic optimization.

Stochastic Optimization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.