Multiple Captions Embellished Multilingual Multi-Modal Neural Machine Translation

MMTLRL (RANLP) 2021 · Salam Michael Singh, Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay ·

Neural machine translation based on bilingual text with limited training data suffers from lexical diversity, which lowers the rare word translation accuracy and reduces the generalizability of the translation system. In this work, we utilise the multiple captions from the Multi-30K dataset to increase the lexical diversity aided with the cross-lingual transfer of information among the languages in a multilingual setup. In this multilingual and multimodal setting, the inclusion of the visual features boosts the translation quality by a significant margin. Empirical study affirms that our proposed multimodal approach achieves substantial gain in terms of the automatic score and shows robustness in handling the rare word translation in the pretext of English to/from Hindi and Telugu translation tasks.

PDF Abstract