no code implementations • 1 Jan 2020 • Kai Jiang, XiaoLong Qin
But the rewards in the actual environment are sparse, and even some environments will not rewards.
reinforcement-learning Reinforcement Learning (RL)