no code implementations • IWSLT (ACL) 2022 • Ngoc-Quan Pham, Tuan Nam Nguyen, Thai-Binh Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Alexander Waibel
Pretrained models in acoustic and textual modalities can potentially improve speech translation for both Cascade and End-to-end approaches.
no code implementations • 7 Aug 2023 • Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel
Secondly, we compare different approaches to low-latency speech translation using this framework.
1 code implementation • 8 Jun 2023 • Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues
In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks.
no code implementations • 9 Jun 2022 • Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazim Kemal Ekenel
The system is designed to combine multiple component models and produces a video of the original speaker speaking in the target language that is lip-synchronous with the target speech, yet maintains emphases in speech, voice characteristics, face video of the original speaker.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +6
no code implementations • IWSLT (ACL) 2022 • Peter Polák, Ngoc-Quan Ngoc, Tuan-Nam Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Ondřej Bojar, Alexander Waibel
In this paper, we describe our submission to the Simultaneous Speech Translation at IWSLT 2022.
no code implementations • 11 Mar 2021 • Carlos Mullov, Ngoc-Quan Pham, Alexander Waibel
In an attempt to train the mapping from the encoder sentence representation to a new target language we use our model as an autoencoder.