Search Results for author: Sanjika Hewavitharana

Found 8 papers, 2 papers with code

ITEm: Unsupervised Image-Text Embedding Learning for eCommerce

no code implementations22 Oct 2023 Baohao Liao, Michael Kozielski, Sanjika Hewavitharana, Jiangbo Yuan, Shahram Khadivi, Tomer Lancewicki

How to teach a model to learn embedding from different modalities without neglecting information from the less dominant modality is challenging.

Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token

1 code implementation9 Nov 2022 Baohao Liao, David Thulke, Sanjika Hewavitharana, Hermann Ney, Christof Monz

We show: (1) [MASK]s can indeed be appended at a later layer, being disentangled from the word embedding; (2) The gathering of contextualized information from unmasked tokens can be conducted with a few layers.

Back-translation for Large-Scale Multilingual Machine Translation

1 code implementation WMT (EMNLP) 2021 Baohao Liao, Shahram Khadivi, Sanjika Hewavitharana

Surprisingly, the smaller size of vocabularies perform better, and the extensive monolingual English data offers a modest improvement.

Machine Translation Translation

Towards Semantic Query Segmentation

no code implementations25 Jul 2017 Ajinkya Kale, Thrivikrama Taula, Sanjika Hewavitharana, Amit Srivastava

Query Segmentation is one of the critical components for understanding users' search intent in Information Retrieval tasks.

Information Retrieval Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.