Search Results for author: Xunyu Zhou

Found 1 papers, 0 papers with code

Exploration versus exploitation in reinforcement learning: a stochastic control approach

no code implementations4 Dec 2018 Haoran Wang, Thaleia Zariphopoulou, Xunyu Zhou

We carry out a complete analysis of the problem in the linear--quadratic (LQ) setting and deduce that the optimal feedback control distribution for balancing exploitation and exploration is Gaussian.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.