Search Results for author: Verena Blaschke

Found 9 papers, 5 papers with code

Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data

1 code implementation19 Mar 2024 Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova, Barbara Plank

Named Entity Recognition (NER) is a fundamental task to extract key information from texts, but annotated resources are scarce for dialects.

Dialect Identification Multi-Task Learning +3

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank

no code implementations15 Mar 2024 Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze, Barbara Plank

Despite the success of the Universal Dependencies (UD) project exemplified by its impressive language breadth, there is still a lack in `within-language breadth': most treebanks focus on standard languages.

POS POS Tagging

MaiBaam Annotation Guidelines

no code implementations9 Mar 2024 Verena Blaschke, Barbara Kovačić, Siyao Peng, Barbara Plank

This document provides the annotation guidelines for MaiBaam, a Bavarian corpus annotated with part-of-speech (POS) tags and syntactic dependencies.

POS

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties

1 code implementation3 Feb 2024 Ekaterina Artemova, Verena Blaschke, Barbara Plank

Inspired by prior work on English varieties, we craft and manually evaluate perturbation rules that transform German sentences into colloquial forms and use them to synthesize test sets in four ToD datasets.

Intent Recognition slot-filling +3

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages

5 code implementations20 Apr 2023 Verena Blaschke, Hinrich Schütze, Barbara Plank

This can for instance be observed when finetuning PLMs on one language and evaluating them on data in a closely related language variety with no standardized orthography.

Cross-Lingual Transfer Part-Of-Speech Tagging +2

A Survey of Corpora for Germanic Low-Resource Languages and Dialects

2 code implementations19 Apr 2023 Verena Blaschke, Hinrich Schütze, Barbara Plank

In this work, we instead focus on low-resource languages and in particular non-standardized low-resource languages.

Cannot find the paper you are looking for? You can Submit a new open access paper.