1 code implementation • 19 Mar 2024 • Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova, Barbara Plank
Named Entity Recognition (NER) is a fundamental task to extract key information from texts, but annotated resources are scarce for dialects.
no code implementations • 15 Mar 2024 • Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze, Barbara Plank
Despite the success of the Universal Dependencies (UD) project exemplified by its impressive language breadth, there is still a lack in `within-language breadth': most treebanks focus on standard languages.
no code implementations • 9 Mar 2024 • Verena Blaschke, Barbara Kovačić, Siyao Peng, Barbara Plank
This document provides the annotation guidelines for MaiBaam, a Bavarian corpus annotated with part-of-speech (POS) tags and syntactic dependencies.
no code implementations • 19 Feb 2024 • Verena Blaschke, Christoph Purschke, Hinrich Schütze, Barbara Plank
Natural language processing (NLP) has largely focused on modelling standardized languages.
1 code implementation • 3 Feb 2024 • Ekaterina Artemova, Verena Blaschke, Barbara Plank
Inspired by prior work on English varieties, we craft and manually evaluate perturbation rules that transform German sentences into colloquial forms and use them to synthesize test sets in four ToD datasets.
5 code implementations • 20 Apr 2023 • Verena Blaschke, Hinrich Schütze, Barbara Plank
This can for instance be observed when finetuning PLMs on one language and evaluating them on data in a closely related language variety with no standardized orthography.
2 code implementations • 19 Apr 2023 • Verena Blaschke, Hinrich Schütze, Barbara Plank
In this work, we instead focus on low-resource languages and in particular non-standardized low-resource languages.
1 code implementation • SEMEVAL 2020 • Verena Blaschke, Maxim Korniyenko, Sam Tureski
This paper describes our participation in the SemEval-2020 task Detection of Propaganda Techniques in News Articles.
Propaganda span identification Propaganda technique identification
no code implementations • COLING 2018 • {\c{C}}a{\u{g}}r{\i} {\c{C}}{\"o}ltekin, Taraka Rama, Verena Blaschke
This paper describes our systems for the VarDial 2018 evaluation campaign.