no code implementations • WMT (EMNLP) 2020 • Haijiang Wu, Zixuan Wang, Qingsong Ma, Xinjie Wen, Ruichen Wang, Xiaoli Wang, Yulin Zhang, Zhipeng Yao, Siyao Peng
This paper presents Tencent’s submission to the WMT20 Quality Estimation (QE) Shared Task: Sentence-Level Post-editing Effort for English-Chinese in Task 2.
no code implementations • 20 Apr 2023 • Mingjun Zhao, Mengzhen Wang, Yinglong Ma, Di Niu, Haijiang Wu
To address this issue, we propose CEIL, a novel Classification-Enhanced Iterative Learning framework for short text clustering, which aims at generally promoting the clustering performance by introducing a classification objective to iteratively improve feature representations.
1 code implementation • 31 May 2021 • Mingjun Zhao, Haijiang Wu, Di Niu, Zixuan Wang, Xiaoli Wang
Verdi adopts two word predictors to enable diverse features to be extracted from a pair of sentences for subsequent quality estimation, including a transformer-based neural machine translation (NMT) model and a pre-trained cross-lingual language model (XLM).
no code implementations • 13 Apr 2020 • Mingjun Zhao, Haijiang Wu, Di Niu, Xiaoli Wang
Specifically, we propose a data selection framework based on Deterministic Actor-Critic, in which a critic network predicts the expected change of model performance due to a certain sample, while an actor network learns to select the best sample out of a random batch of samples presented to it.