no code implementations • 23 Jun 2019 • Sergey A. Shuvaev, Ngoc B. Tran, Marcus Stephenson-Jones, Bo Li, Alexei A. Koulakov
First, we show that Q-learning neural networks with motivation can navigate in environment with dynamic rewards.
Hierarchical Reinforcement Learning Navigate +3