1 code implementation • 6 Jan 2016 • Imanol Arrieta Ibarra, Bernardo Ramos, Lars Roemheld
We train a reinforcement learner to play a simplified version of the game Angry Birds.
Efficient Exploration Q-Learning +2