Search Results for author: Madalina M. Drugan

Found 1 papers, 1 papers with code

Sampled Policy Gradient for Learning to Play the Game Agar.io

2 code implementations15 Sep 2018 Anton Orell Wiehe, Nil Stolt Ansó, Madalina M. Drugan, Marco A. Wiering

In this paper, a new offline actor-critic learning algorithm is introduced: Sampled Policy Gradient (SPG).

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.