no code implementations • 25 Apr 2018 • Ermo Wei, Drew Wicke, David Freelan, Sean Luke
Policy gradient methods are often applied to reinforcement learning in continuous multiagent games.
Policy Gradient Methods Q-Learning +2