Search Results for author: Prashant Mathur

Found 24 papers, 9 papers with code

GFST: Gender-Filtered Self-Training for More Accurate Gender in Translation

1 code implementation • EMNLP 2021 • Prafulla Kumar Choubey, Anna Currey, Prashant Mathur, Georgiana Dinu

Targeted evaluations have found that machine translation systems often output incorrect gender in translations, even when the gender is clear from context.

Machine Translation Translation

Paper
Code

Findings of the IWSLT 2022 Evaluation Campaign

no code implementations • IWSLT (ACL) 2022 • Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe

The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation.

Speech-to-Speech Translation Translation

Paper
Add Code

Distilling Multiple Domains for Neural Machine Translation

no code implementations • EMNLP 2020 • Anna Currey, Prashant Mathur, Georgiana Dinu

Neural machine translation achieves impressive results in high-resource conditions, but performance often suffers when the input domain is low-resource.

Machine Translation Translation

Paper
Add Code

PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores

1 code implementation • 10 Apr 2024 • Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania, Metehan Cekic, Marcello Federico, Kyu J. Han

Recent advancements in audio-visual generative modeling have been propelled by progress in deep learning and the availability of data-rich benchmarks.

Audio-Visual Synchronization

Paper
Code

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

1 code implementation • 1 Nov 2023 • Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico

Conventional speech-to-text translation (ST) systems are trained on single-speaker utterances, and they may not generalize to real-life scenarios where the audio contains conversations by multiple speakers.

Automatic Speech Recognition speech-recognition +3

Paper
Code

Improving Isochronous Machine Translation with Target Factors and Auxiliary Counters

no code implementations • 22 May 2023 • Proyag Pal, Brian Thompson, Yogesh Virkar, Prashant Mathur, Alexandra Chronopoulou, Marcello Federico

To translate speech for automatic dubbing, machine translation needs to be isochronous, i. e. translated speech needs to be aligned with the source in terms of speech durations.

Machine Translation Translation

Paper
Add Code

Jointly Optimizing Translations and Speech Timing to Improve Isochrony in Automatic Dubbing

1 code implementation • 25 Feb 2023 • Alexandra Chronopoulou, Brian Thompson, Prashant Mathur, Yogesh Virkar, Surafel M. Lakew, Marcello Federico

Automatic dubbing (AD) is the task of translating the original speech in a video into target language speech.

Translation

Paper
Code

Improving Robustness of Retrieval Augmented Translation via Shuffling of Suggestions

no code implementations • 11 Oct 2022 • Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico

Several recent studies have reported dramatic performance improvements in neural machine translation (NMT) by augmenting translation at inference time with fuzzy-matches retrieved from a translation memory (TM).

Machine Translation NMT +2

Paper
Add Code

Improving Retrieval Augmented Neural Machine Translation by Controlling Source and Fuzzy-Match Interactions

no code implementations • 10 Oct 2022 • Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico

We explore zero-shot adaptation, where a general-domain model has access to customer or domain specific parallel data at inference time, but not during training.

Machine Translation Retrieval +2

Paper
Add Code

Automatic Evaluation and Analysis of Idioms in Neural Machine Translation

1 code implementation • 10 Oct 2022 • Christos Baziotis, Prashant Mathur, Eva Hasler

A major open problem in neural machine translation (NMT) is the translation of idiomatic expressions, such as "under the weather".

Machine Translation NMT +1

Paper
Code

Embarrassingly Easy Document-Level MT Metrics: How to Convert Any Pretrained Metric Into a Document-Level Metric

1 code implementation • 27 Sep 2022 • Giorgos Vernikos, Brian Thompson, Prashant Mathur, Marcello Federico

Our experimental results support our initial hypothesis and show that a simple extension of the metrics permits them to take advantage of context to resolve ambiguities in the reference.

Machine Translation Sentence

Paper
Code

Sockeye 3: Fast Neural Machine Translation with PyTorch

2 code implementations • 12 Jul 2022 • Felix Hieber, Michael Denkowski, Tobias Domhan, Barbara Darques Barros, Celina Dong Ye, Xing Niu, Cuong Hoang, Ke Tran, Benjamin Hsu, Maria Nadejde, Surafel Lakew, Prashant Mathur, Anna Currey, Marcello Federico

When running comparable models, Sockeye 3 is up to 126% faster than other PyTorch implementations on GPUs and up to 292% faster on CPUs.

Machine Translation NMT +1

1,207

Paper
Code

Isometric MT: Neural Machine Translation for Automatic Dubbing

no code implementations • 16 Dec 2021 • Surafel M. Lakew, Yogesh Virkar, Prashant Mathur, Marcello Federico

Automatic dubbing (AD) is among the machine translation (MT) use cases where translations should match a given length to allow for synchronicity between source and target speech.

Machine Translation Re-Ranking +2

Paper
Add Code

Isochrony-Aware Neural Machine Translation for Automatic Dubbing

no code implementations • 16 Dec 2021 • Derek Tam, Surafel M. Lakew, Yogesh Virkar, Prashant Mathur, Marcello Federico

We introduce the task of isochrony-aware machine translation which aims at generating translations suitable for dubbing.

Machine Translation Sentence +1

Paper
Add Code

Faithful Target Attribute Prediction in Neural Machine Translation

1 code implementation • 24 Sep 2021 • Xing Niu, Georgiana Dinu, Prashant Mathur, Anna Currey

The training data used in NMT is rarely controlled with respect to specific attributes, such as word casing or gender, which can cause errors in translations.

Attribute Data Augmentation +4

Paper
Code

Improving Gender Translation Accuracy with Filtered Self-Training

no code implementations • 15 Apr 2021 • Prafulla Kumar Choubey, Anna Currey, Prashant Mathur, Georgiana Dinu

Targeted evaluations have found that machine translation systems often output incorrect gender, even when the gender is clear from context.

Machine Translation Sentence +1

Paper
Add Code

Evaluating Robustness to Input Perturbations for Neural Machine Translation

no code implementations • ACL 2020 • Xing Niu, Prashant Mathur, Georgiana Dinu, Yaser Al-Onaizan

Neural Machine Translation (NMT) models are sensitive to small perturbations in the input.

Machine Translation NMT +1

Paper
Add Code

Joint translation and unit conversion for end-to-end localization

no code implementations • WS 2020 • Georgiana Dinu, Prashant Mathur, Marcello Federico, Stanislas Lauly, Yaser Al-Onaizan

A variety of natural language tasks require processing of textual data which contains a mix of natural language and formal languages such as mathematical expressions.

Data Augmentation Translation