Search Results for author: Sanjika Hewavitharana

Found 8 papers, 2 papers with code

ITEm: Unsupervised Image-Text Embedding Learning for eCommerce

no code implementations • 22 Oct 2023 • Baohao Liao, Michael Kozielski, Sanjika Hewavitharana, Jiangbo Yuan, Shahram Khadivi, Tomer Lancewicki

How to teach a model to learn embedding from different modalities without neglecting information from the less dominant modality is challenging.

Paper
Add Code

Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token

1 code implementation • 9 Nov 2022 • Baohao Liao, David Thulke, Sanjika Hewavitharana, Hermann Ney, Christof Monz

We show: (1) [MASK]s can indeed be appended at a later layer, being disentangled from the word embedding; (2) The gathering of contextualized information from unmasked tokens can be conducted with a few layers.

Paper
Code

Back-translation for Large-Scale Multilingual Machine Translation

1 code implementation • WMT (EMNLP) 2021 • Baohao Liao, Shahram Khadivi, Sanjika Hewavitharana

Surprisingly, the smaller size of vocabularies perform better, and the extensive monolingual English data offers a modest improvement.

Machine Translation Translation

Paper
Code

Word-based Domain Adaptation for Neural Machine Translation

no code implementations • IWSLT (EMNLP) 2018 • Shen Yan, Leonard Dahlmann, Pavel Petrushkov, Sanjika Hewavitharana, Shahram Khadivi

Pre-training a model with word weights improves fine-tuning up to 1. 24% BLEU absolute and 1. 64% TER, respectively.

Domain Adaptation Language Modelling +3

Paper
Add Code

Towards Semantic Query Segmentation

no code implementations • 25 Jul 2017 • Ajinkya Kale, Thrivikrama Taula, Sanjika Hewavitharana, Amit Srivastava

Query Segmentation is one of the critical components for understanding users' search intent in Information Retrieval tasks.

Information Retrieval Retrieval +1

Paper
Add Code

Lightly-Supervised Word Sense Translation Error Detection for an Interactive Conversational Spoken Language Translation System

no code implementations • EACL 2014 • Dennis Mehay, Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana

Machine Translation Translation +1

Paper
Add Code

Incremental Topic-Based Translation Model Adaptation for Conversational Spoken Language Translation

no code implementations • ACL 2013 • Sanjika Hewavitharana, Dennis Mehay, Sankaranarayanan Ananthakrishnan, Prem Natarajan

Machine Translation Medical Diagnosis +1

Paper
Add Code

Interactive Error Resolution Strategies for Speech-to-Speech Translation Systems

no code implementations • WS 2013 • Rohit Kumar, Matthew Roy, Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Frederick Choi

Cross-Lingual Transfer Speech-to-Speech Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.