no code implementations • 3 Dec 2022 • Yanjiang Guo, Jingyue Gao, Zheng Wu, Chengming Shi, Jianyu Chen
In this paper, we consider the case where the target task is mismatched from but similar with that of the expert.
reinforcement-learning Reinforcement Learning (RL) +1