1 code implementation • 28 Jan 2024 • Longxiang Liu, Xiuxing Li, Yang Feng
Specifically, we model the hierarchical policy as trees and utilize the similarity between trees to sample negative policy based on scheduled sampling, hoping the model to generate invariant responses under perturbations.
no code implementations • 10 Jan 2023 • Zhuosheng Zhang, Hai Zhao, Longxiang Liu
We decouple the contextualized word representations by masking mechanisms in Transformer-based PrLM, making each word only focus on the words in current utterance, other utterances, and two speaker roles (i. e., utterances of sender and utterances of the receiver), respectively.
1 code implementation • 14 Sep 2020 • Longxiang Liu, Zhuosheng Zhang, Hai Zhao, Xi Zhou, Xiang Zhou
A multi-turn dialogue is composed of multiple utterances from two or more different speaker roles.