no code implementations • 18 Jul 2023 • Hossein Abouee-Mehrizi, Mahdi Mirjalili, Vahid Sarhangian
We approximate the value function using a linear combination of basis functions and tune the parameters using a simulation-based policy iteration algorithm.