Producing Standard German Subtitles for Swiss German TV Content

SLPAT (ACL) 2022 · Johanna Gerlach, Jonathan Mutal, Bouillon Pierrette ·

In this study we compare two approaches (neural machine translation and edit-based) and the use of synthetic data for the task of translating normalised Swiss German ASR output into correct written Standard German for subtitles, with a special focus on syntactic differences. Results suggest that NMT is better suited to this task and that relatively simple rule-based generation of training data could be a valuable approach for cases where little training data is available and transformations are simple.

PDF Abstract