no code implementations • 25 Sep 2019 • Xi Chen, Yuan Gao, Ali Ghadirzadeh, Marten Bjorkman, Ginevra Castellano, Patric Jensfelt
In this work, we introduce an exploration approach based on maximizing the entropy of the visited states while learning a goal-conditioned policy.