no code implementations • COLING 2022 • Zihao Feng, Hailong Cao, Tiejun Zhao, Weixuan Wang, Wei Peng
Despite their progress in high-resource language settings, unsupervised bilingual lexicon induction (UBLI) models often fail on corpora with low-resource distant language pairs due to insufficient initialization.
no code implementations • 26 May 2021 • Hailong Cao, Tiejun Zhao
Embeddings of two languages are made to match with each other by rotating and scaling.
no code implementations • 16 Mar 2021 • Pengbo Liu, Hailong Cao, Tiejun Zhao
Multi-modal machine translation (MMT) improves translation quality by introducing visual information.
Ranked #5 on Multimodal Machine Translation on Multi30K
no code implementations • 3 Sep 2019 • Xuefeng Bai, Yue Zhang, Hailong Cao, Tiejun Zhao
Unsupervised bilingual lexicon induction naturally exhibits duality, which results from symmetry in back-translation.
no code implementations • COLING 2016 • Hailong Cao, Tiejun Zhao, Shu Zhang, Yao Meng
We introduce a distribution based model to learn bilingual word embeddings from monolingual data.