no code implementations • IWSLT (ACL) 2022 • Frithjof Petrick, Jan Rosendahl, Christian Herold, Hermann Ney
After its introduction the Transformer architecture quickly became the gold standard for the task of neural machine translation.
no code implementations • EMNLP (insights) 2021 • Jan Rosendahl, Christian Herold, Frithjof Petrick, Hermann Ney
In this work, we conduct a comprehensive investigation on one of the centerpieces of modern machine translation systems: the encoder-decoder attention mechanism.
no code implementations • 18 Oct 2023 • Frithjof Petrick, Christian Herold, Pavel Petrushkov, Shahram Khadivi, Hermann Ney
Finally, we explore language model fusion in the light of recent advancements in large language models.