Combinational Q-Learning for Dou Di Zhu

24 Jan 2019 Yang You Liangwei Li Baisong Guo Weiming Wang Cewu Lu

Deep reinforcement learning (DRL) has gained a lot of attention in recent years, and has been proven to be able to play Atari games and Go at or above human levels. However, those games are assumed to have a small fixed number of actions and could be trained with a simple CNN network... (read more)

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
Entropy Regularization
Regularization
Convolution
Convolutions
Dense Connections
Feedforward Networks
Softmax
Output Functions
A3C
Policy Gradient Methods
Q-Learning
Off-Policy TD Control