no code implementations • 3 Jun 2022 • Chaitanya Agarwal, Shibashis Guha, Jan Křetínský, M. Pazhamalai
We provide the first algorithm to compute mean payoff probably approximately correctly in unknown MDP; further, we extend it to unknown CTMDP.