no code implementations • 1 Dec 2021 • Linhao Li, Ming Xu, Yongfeng Dong, Xin Li, Ao Wang
Therefore, we propose to group instead of ranking the hypotheses and design a structural loss called ``joint softmax focal loss'' in this paper.
Language Modelling Natural Language Inference