no code implementations • 1 Dec 2021 • Yiwen Zhu, Zhou Fang, Yuan Zheng, Wenya Wei
In this paper, we propose a homotopy-based soft actor-critic method (HSAC) which focuses on addressing these problems via following the homotopy path between the original task with sparse reward and the auxiliary task with artificial prior experience reward.