Search Results for author: Boris Lesner

Found 2 papers, 0 papers with code

Tight Performance Bounds for Approximate Modified Policy Iteration with Non-Stationary Policies

no code implementations20 Apr 2013 Boris Lesner, Bruno Scherrer

For this algorithm we provide an error propagation analysis in the form of a performance bound of the resulting policies that can improve the usual performance bound by a factor $O(1-\gamma)$, which is significant when the discount factor $\gamma$ is close to 1.

On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes

no code implementations NeurIPS 2012 Bruno Scherrer, Boris Lesner

We consider infinite-horizon stationary $\gamma$-discounted Markov Decision Processes, for which it is known that there exists a stationary optimal policy.

Cannot find the paper you are looking for? You can Submit a new open access paper.