6 code implementations • 3 Mar 2016 • Johannes Heinrich, David Silver
When applied to Leduc poker, Neural Fictitious Self-Play (NFSP) approached a Nash equilibrium, whereas common reinforcement learning methods diverged.
Game of Poker reinforcement-learning +1