no code implementations • EMNLP (Eval4NLP) 2020 • Reda Yacouby, Dustin Axman
We present a probabilistic extension of Precision, Recall, and F1 score, which we refer to as confidence-Precision (cPrecision), confidence-Recall (cRecall), and confidence-F1 (cF1) respectively.
no code implementations • 16 Oct 2023 • Dustin Axman, Avik Ray, Shubham Garg, Jing Huang
While dialog response generation has been widely studied on the agent side, it is not evident if similar generative models can be used to generate a large variety of, and often unexpected, user inputs that real dialog systems encounter in practice.