1 code implementation • 5 May 2020 • Budi Kurniawan, Peter Vamplew, Michael Papasimeon, Richard Dazeley, Cameron Foale
It then selects from each discrete state an input value and the action with the highest numerical preference as an input/target pair.