Search Results for author: Chandramouli Kamanchi

Found 7 papers, 3 papers with code

A Convergent Off-Policy Temporal Difference Algorithm

1 code implementation13 Nov 2019 Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

In this work, we propose a convergent on-line off-policy TD algorithm under linear function approximation.

Reinforcement Learning (RL)

Generalized Speedy Q-learning

1 code implementation1 Nov 2019 Indu John, Chandramouli Kamanchi, Shalabh Bhatnagar

In most RL algorithms such as Q-learning, the Bellman equation and the Bellman operator play an important role.

Q-Learning Reinforcement Learning (RL)

Generalized Second Order Value Iteration in Markov Decision Processes

2 code implementations10 May 2019 Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

In this work, we propose a second order value iteration procedure that is obtained by applying the Newton-Raphson method to the successive relaxation value iteration scheme.

Successive Over Relaxation Q-Learning

no code implementations9 Mar 2019 Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

We first derive a modified fixed point iteration for SOR Q-values and utilize stochastic approximation to derive a learning algorithm to compute the optimal value function and an optimal policy.

Q-Learning Reinforcement Learning (RL)

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

no code implementations11 Feb 2019 Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Prabuchandran K. J., Shalabh Bhatnagar

In many of the practical applications, the analytical form of the density is not known and only the samples from the distribution are available.

Cannot find the paper you are looking for? You can Submit a new open access paper.