no code implementations • Findings (NAACL) 2022 • Yang Zhao, Hua Qin, Wang Zhenyu, Changxi Zhu, Shihan Wang
It supports evaluating the difficulty of dialogue tasks only using the learning experiences of dialogue policy and skip-level selection according to their learning needs to maximize the learning efficiency.