no code implementations • 30 May 2023 • Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei
This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes.