Soft Q Network

20 Dec 2019 Jingbin Liu Shuai Liu Xinyang Gu

Deep Q Network (DQN) is a very successful algorithm, yet the inherent problem of reinforcement learning, i.e. the exploit-explore balance, remains. In this work, we introduce entropy regularization into DQN and propose SQN... (read more)

PDF Abstract
No code implementations yet. Submit your code now

Datasets


Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
Q-Learning
Off-Policy TD Control
Entropy Regularization
Regularization
Dense Connections
Feedforward Networks
Convolution
Convolutions
DQN
Q-Learning Networks