no code implementations • EMNLP 2021 • Yangyang Zhao, Zhenyu Wang, Changxi Zhu, Shihan Wang
Most of the existing dialogue policy methods rely on a single learning system, while the human brain has two specialized learning and memory systems, supporting to find good solutions without requiring copious examples.
no code implementations • Findings (NAACL) 2022 • Yang Zhao, Hua Qin, Wang Zhenyu, Changxi Zhu, Shihan Wang
It supports evaluating the difficulty of dialogue tasks only using the learning experiences of dialogue policy and skip-level selection according to their learning needs to maximize the learning efficiency.
no code implementations • 16 Mar 2022 • Changxi Zhu, Mehdi Dastani, Shihan Wang
Communication is an effective mechanism for coordinating the behavior of multiple agents.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • IJCNLP 2019 • Xingwei Tan, Yi Cai, Changxi Zhu
Aspect-level sentiment classification, which is a fine-grained sentiment analysis task, has received lots of attention these years.