Search Results for author: Simon Li

Found 1 papers, 0 papers with code

Inferring the Optimal Policy using Markov Chain Monte Carlo

no code implementations16 Nov 2019 Brandon Trabucco, Albert Qu, Simon Li, Ganeshkumar Ashokavardhanan

Existing methods for estimating the optimal stochastic control policy rely on high variance estimates of the policy descent.

Cannot find the paper you are looking for? You can Submit a new open access paper.