1 code implementation • EMNLP 2021 • Prafulla Kumar Choubey, Anna Currey, Prashant Mathur, Georgiana Dinu
Targeted evaluations have found that machine translation systems often output incorrect gender in translations, even when the gender is clear from context.
no code implementations • IWSLT (ACL) 2022 • Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe
The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation.
no code implementations • EMNLP 2020 • Anna Currey, Prashant Mathur, Georgiana Dinu
Neural machine translation achieves impressive results in high-resource conditions, but performance often suffers when the input domain is low-resource.
1 code implementation • 10 Apr 2024 • Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania, Metehan Cekic, Marcello Federico, Kyu J. Han
Recent advancements in audio-visual generative modeling have been propelled by progress in deep learning and the availability of data-rich benchmarks.
1 code implementation • 1 Nov 2023 • Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico
Conventional speech-to-text translation (ST) systems are trained on single-speaker utterances, and they may not generalize to real-life scenarios where the audio contains conversations by multiple speakers.
no code implementations • 22 May 2023 • Proyag Pal, Brian Thompson, Yogesh Virkar, Prashant Mathur, Alexandra Chronopoulou, Marcello Federico
To translate speech for automatic dubbing, machine translation needs to be isochronous, i. e. translated speech needs to be aligned with the source in terms of speech durations.
1 code implementation • 25 Feb 2023 • Alexandra Chronopoulou, Brian Thompson, Prashant Mathur, Yogesh Virkar, Surafel M. Lakew, Marcello Federico
Automatic dubbing (AD) is the task of translating the original speech in a video into target language speech.
no code implementations • 11 Oct 2022 • Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico
Several recent studies have reported dramatic performance improvements in neural machine translation (NMT) by augmenting translation at inference time with fuzzy-matches retrieved from a translation memory (TM).
no code implementations • 10 Oct 2022 • Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico
We explore zero-shot adaptation, where a general-domain model has access to customer or domain specific parallel data at inference time, but not during training.
1 code implementation • 10 Oct 2022 • Christos Baziotis, Prashant Mathur, Eva Hasler
A major open problem in neural machine translation (NMT) is the translation of idiomatic expressions, such as "under the weather".
1 code implementation • 27 Sep 2022 • Giorgos Vernikos, Brian Thompson, Prashant Mathur, Marcello Federico
Our experimental results support our initial hypothesis and show that a simple extension of the metrics permits them to take advantage of context to resolve ambiguities in the reference.
2 code implementations • 12 Jul 2022 • Felix Hieber, Michael Denkowski, Tobias Domhan, Barbara Darques Barros, Celina Dong Ye, Xing Niu, Cuong Hoang, Ke Tran, Benjamin Hsu, Maria Nadejde, Surafel Lakew, Prashant Mathur, Anna Currey, Marcello Federico
When running comparable models, Sockeye 3 is up to 126% faster than other PyTorch implementations on GPUs and up to 292% faster on CPUs.
no code implementations • 16 Dec 2021 • Surafel M. Lakew, Yogesh Virkar, Prashant Mathur, Marcello Federico
Automatic dubbing (AD) is among the machine translation (MT) use cases where translations should match a given length to allow for synchronicity between source and target speech.
no code implementations • 16 Dec 2021 • Derek Tam, Surafel M. Lakew, Yogesh Virkar, Prashant Mathur, Marcello Federico
We introduce the task of isochrony-aware machine translation which aims at generating translations suitable for dubbing.
1 code implementation • 24 Sep 2021 • Xing Niu, Georgiana Dinu, Prashant Mathur, Anna Currey
The training data used in NMT is rarely controlled with respect to specific attributes, such as word casing or gender, which can cause errors in translations.
no code implementations • 15 Apr 2021 • Prafulla Kumar Choubey, Anna Currey, Prashant Mathur, Georgiana Dinu
Targeted evaluations have found that machine translation systems often output incorrect gender, even when the gender is clear from context.
no code implementations • ACL 2020 • Xing Niu, Prashant Mathur, Georgiana Dinu, Yaser Al-Onaizan
Neural Machine Translation (NMT) models are sensitive to small perturbations in the input.
no code implementations • WS 2020 • Georgiana Dinu, Prashant Mathur, Marcello Federico, Stanislas Lauly, Yaser Al-Onaizan
A variety of natural language tasks require processing of textual data which contains a mix of natural language and formal languages such as mathematical expressions.
1 code implementation • ACL 2019 • Georgiana Dinu, Prashant Mathur, Marcello Federico, Yaser Al-Onaizan
This paper proposes a novel method to inject custom terminology into neural machine translation at run time.
no code implementations • WS 2018 • Jos{\'e} G. Camargo de Souza, Michael Kozielski, Prashant Mathur, Ernie Chang, Marco Guerini, Matteo Negri, Marco Turchi, Evgeny Matusov
The setting requires the generation process to be fast and the generated title to be both human-readable and concise.
no code implementations • NAACL 2018 • Prashant Mathur, Nicola Ueffing, Gregor Leusch
These browse pages require a title describing the content of the page.
no code implementations • WS 2017 • Prashant Mathur, Nicola Ueffing, Gregor Leusch
We present two approaches to generate titles for browse pages in five different languages, namely English, German, French, Italian and Spanish.