Deep Q-Learning has been successfully applied to a wide variety of tasks in the past several years. However, the architecture of the vanilla Deep Q-Network is not suited to deal with partially observable environments such as 3D video games... (read more)
PDFMETHOD | TYPE | |
---|---|---|
![]() |
Off-Policy TD Control |