1 code implementation • 6 May 2019 • Alexander Zap, Tobias Joppen, Johannes Fürnkranz
Reinforcement learning usually makes use of numerical rewards, which have nice properties but also come with drawbacks and difficulties.
OpenAI Gym Q-Learning +2