no code implementations • 12 Feb 2024 • Siyuan Li, Shijie Han, Yingnan Zhao, By Liang, Peng Liu
To achieve automatic auxiliary reward generation, we propose a novel representation learning approach that can measure the ``transition distance'' between states.