no code implementations • 13 Apr 2020 • Eric B. Jones, Peter Graf, Eliot Kapit, Wesley Jones
The Markov decision process is the mathematical formalization underlying the modern field of reinforcement learning when transition and reward functions are unknown.