no code implementations • LREC 2022 • Julian Linke, Philip N. Garner, Gernot Kubin, Barbara Schuppler
Conversational speech represents one of the most complex of automatic speech recognition (ASR) tasks owing to the high inter-speaker variation in both pronunciation and conversational dynamics.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 16 Jan 2023 • Julian Linke, Saskia Wepner, Gernot Kubin, Barbara Schuppler
In order to deal with having only limited resources available for conversational German and, at the same time, with a large variation among speakers with respect to pronunciation characteristics, we improve a Kaldi-based ASR system by incorporating a (large) knowledge-based pronunciation lexicon, while exploring different data-based methods to restrict the number of pronunciation variants for each lexical entry.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2