no code implementations • EAMT 2022 • Mattia Di Gangi, Nick Rossenbach, Alejandro Pérez, Parnia Bahar, Eugen Beck, Patrick Wilken, Evgeny Matusov
The revoicing usually comes with a changed script, mostly in a different language, and the revoicing should reproduce the original emotions, coherent with the body language, and lip synchronized.
no code implementations • IWSLT 2016 • Jan-Thorsten Peter, Andreas Guta, Nick Rossenbach, Miguel Graça, Hermann Ney
This work describes the statistical machine translation (SMT) systems of RWTH Aachen University developed for the evaluation campaign of International Workshop on Spoken Language Translation (IWSLT) 2016.
no code implementations • IWSLT 2017 • Parnia Bahar, Jan Rosendahl, Nick Rossenbach, Hermann Ney
This work describes the Neural Machine Translation (NMT) system of the RWTH Aachen University developed for the English$German tracks of the evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2017.
no code implementations • 12 Oct 2023 • Nick Rossenbach, Benedikt Hilmes, Ralf Schlüter
Synthetic data generated by text-to-speech (TTS) systems can be used to improve automatic speech recognition (ASR) systems in low-resource or domain mismatch tasks.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 6 Jun 2023 • Parnia Bahar, Mattia Di Gangi, Nick Rossenbach, Mohammad Zeineldeen
Automatic Arabic diacritization is useful in many applications, ranging from reading support for language learners to accurate pronunciation predictor for downstream tasks like speech synthesis.
no code implementations • 12 Apr 2021 • Nick Rossenbach, Mohammad Zeineldeen, Benedikt Hilmes, Ralf Schlüter, Hermann Ney
We achieve a final word-error-rate of 3. 3%/10. 0% with a hybrid system on the clean/noisy test-sets, surpassing any previous state-of-the-art systems on Librispeech-100h that do not include unlabeled audio data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 19 Dec 2019 • Nick Rossenbach, Albert Zeyer, Ralf Schlüter, Hermann Ney
We achieve improvements of up to 33% relative in word-error-rate (WER) over a strong baseline with data-augmentation in a low-resource environment (LibriSpeech-100h), closing the gap to a comparable oracle experiment by more than 50\%.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • WS 2019 • Yunsu Kim, Hendrik Rosendahl, Nick Rossenbach, Jan Rosendahl, Shahram Khadivi, Hermann Ney
We propose a novel model architecture and training algorithm to learn bilingual sentence embeddings from a combination of parallel and monolingual data.
no code implementations • WS 2018 • Nick Rossenbach, Jan Rosendahl, Yunsu Kim, Miguel Gra{\c{c}}a, Aman Gokrani, Hermann Ney
We use several rule-based, heuristic methods to preselect sentence pairs.