Search Results for author: Chandramouli Kamanchi

Found 7 papers, 3 papers with code

An Application of Newsboy Problem in Supply Chain Optimisation of Online Fashion E-Commerce

no code implementations • 6 Jul 2020 • Chandramouli Kamanchi, Gopinath Ashok Kumar, Nachiappan Sundaram, Ravindra Babu T, Chaithanya Bandi

We describe a supply chain optimization model deployed in an online fashion e-commerce company in India called Myntra.

Paper
Add Code

A Convergent Off-Policy Temporal Difference Algorithm

1 code implementation • 13 Nov 2019 • Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

In this work, we propose a convergent on-line off-policy TD algorithm under linear function approximation.

Reinforcement Learning (RL)

0

Paper
Code

Generalized Speedy Q-learning

1 code implementation • 1 Nov 2019 • Indu John, Chandramouli Kamanchi, Shalabh Bhatnagar

In most RL algorithms such as Q-learning, the Bellman equation and the Bellman operator play an important role.

Q-Learning Reinforcement Learning (RL)

1

Paper
Code

A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games

no code implementations • 16 Jun 2019 • Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

This problem is formulated as a min-max Markov game in the literature.

Paper
Add Code

Generalized Second Order Value Iteration in Markov Decision Processes

2 code implementations • 10 May 2019 • Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

In this work, we propose a second order value iteration procedure that is obtained by applying the Newton-Raphson method to the successive relaxation value iteration scheme.

0

Paper
Code

Successive Over Relaxation Q-Learning

no code implementations • 9 Mar 2019 • Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

We first derive a modified fixed point iteration for SOR Q-values and utilize stochastic approximation to derive a learning algorithm to compute the optimal value function and an optimal policy.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

no code implementations • 11 Feb 2019 • Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Prabuchandran K. J., Shalabh Bhatnagar

In many of the practical applications, the analytical form of the density is not known and only the samples from the distribution are available.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.