Search Results for author: Josef Ruppenhofer

Found 38 papers, 8 papers with code

Who’s in, who’s out? Predicting the Inclusiveness or Exclusiveness of Personal Pronouns in Parliamentary Debates

no code implementations LREC 2022 Ines Rehbein, Josef Ruppenhofer

This paper presents a compositional annotation scheme to capture the clusivity properties of personal pronouns in context, that is their ability to construct and manage in-groups and out-groups by including/excluding the audience and/or non-speech act participants in reference to groups that also include the speaker.

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

no code implementations UDW (COLING) 2020 Josef Ruppenhofer, Ines Rehbein

We argue that a unified treatment of constructions across languages will increase the consistency of the UD annotations and thus the quality of the treebanks for linguistic analysis.

Multilingual NLP

Identifying Implicitly Abusive Remarks about Identity Groups using a Linguistically Informed Approach

1 code implementation NAACL 2022 Michael Wiegand, Elisabeth Eder, Josef Ruppenhofer

We address the task of distinguishing implicitly abusive sentences on identity groups (“Muslims contaminate our planet”) from other group-related negative polar sentences (“Muslims despise terrorism”).

Abusive Language

Implicitly Abusive Language -- What does it actually look like and why are we not getting there?

no code implementations NAACL 2021 Michael Wiegand, Josef Ruppenhofer, Elisabeth Eder

Abusive language detection is an emerging field in natural language processing which has received a large amount of attention recently.

Abusive Language Position

Implicitly Abusive Comparisons -- A New Dataset and Linguistic Analysis

1 code implementation EACL 2021 Michael Wiegand, Maja Geulig, Josef Ruppenhofer

We examine the task of detecting implicitly abusive comparisons (e. g. {``}Your hair looks like you have been electrocuted{''}).

Exploiting Emojis for Abusive Language Detection

no code implementations EACL 2021 Michael Wiegand, Josef Ruppenhofer

We propose to use abusive emojis, such as the {``}middle finger{''} or {``}face vomiting{''}, as a proxy for learning a lexicon of abusive words.

Abusive Language domain classification

Treebanking User-Generated Content: a UD Based Overview of Guidelines, Corpora and Unified Recommendations

no code implementations3 Nov 2020 Manuela Sanguinetti, Lauren Cassidy, Cristina Bosco, Özlem Çetinoğlu, Alessandra Teresa Cignarella, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djamé Seddah, Amir Zeldes

This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework of syntactic analysis.

A New Resource for German Causal Language

no code implementations LREC 2020 Ines Rehbein, Josef Ruppenhofer

In the paper, we present inter-annotator agreement scores for our dataset and discuss problems for annotating causal language.

Fine-grained Named Entity Annotations for German Biographic Interviews

no code implementations LREC 2020 Josef Ruppenhofer, Ines Rehbein, Carolina Flinz

Building on the OntoNotes 5. 0 NER inventory, our scheme is adapted for a corpus of transcripts of biographic interviews by adding categories for AGE and LAN(guage) and also features extended numeric and temporal categories.

NER

Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies

no code implementations LREC 2020 Manuela Sanguinetti, Cristina Bosco, Lauren Cassidy, {\"O}zlem {\c{C}}etino{\u{g}}lu, Aless Cignarella, ra Teresa, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djam{\'e} Seddah, Amir Zeldes

The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework.

Improving Sentence Boundary Detection for Spoken Language Transcripts

no code implementations LREC 2020 Ines Rehbein, Josef Ruppenhofer, Thomas Schmidt

For the detection of boundaries in spoken language transcripts, we achieve a substantial improvement when framing the boundary detection problem assentence pair classification task, as compared to a sequence tagging approach.

Boundary Detection Sentence +1

Sprucing up the trees -- Error detection in treebanks

no code implementations COLING 2018 Ines Rehbein, Josef Ruppenhofer

We present a method for detecting annotation errors in manually and automatically annotated dependency parse trees, based on ensemble parsing in combination with Bayesian inference, guided by active learning.

Active Learning Bayesian Inference +2

Detecting annotation noise in automatically labelled data

no code implementations ACL 2017 Ines Rehbein, Josef Ruppenhofer

We introduce a method for error detection in automatically annotated text, aimed at supporting the creation of high-quality language resources at affordable cost.

Active Learning Domain Adaptation +2

Catching the Common Cause: Extraction and Annotation of Causal Relations and their Participants

no code implementations WS 2017 Ines Rehbein, Josef Ruppenhofer

In this paper, we present a simple, yet effective method for the automatic identification and extraction of causal relations from text, based on a large English-German parallel corpus.

Language Modelling

Effect Functors for Opinion Inference

no code implementations LREC 2016 Josef Ruppenhofer, Br, Jasper es

We also present results of a crowdsourcing experiment to test the utility of some known and some new functors for opinion inference where, unlike in previous work, subjects are asked to reason from event evaluation to participant evaluation.

Sentiment Analysis

Yes we can!? Annotating English modal verbs

no code implementations LREC 2012 Josef Ruppenhofer, Ines Rehbein

This paper presents an annotation scheme for English modal verbs together with sense-annotated data from the news domain.

Sentiment Analysis Subjectivity Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.