1 code implementation • ACL 2022 • Ruolan Yang, Zitong Li, Haifeng Tang, Kenny Zhu
Existing automatic evaluation systems of chatbots mostly rely on static chat scripts as ground truth, which is hard to obtain, and requires access to the models of the bots as a form of “white-box testing”.