Search Results for author: J. G. Dai

Found 3 papers, 0 papers with code

Refined Policy Improvement Bounds for MDPs

no code implementations16 Jul 2021 J. G. Dai, Mark Gluzman

The existing bound leads to a degenerate bound when the discount factor approaches one, making the applicability of TRPO and related algorithms questionable when the discount factor is close to one.

Queueing Network Controls via Deep Reinforcement Learning

no code implementations31 Jul 2020 J. G. Dai, Mark Gluzman

A key to the successes of our PPO algorithm is the use of three variance reduction techniques in estimating the relative value function via sampling.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.