Search Results for author: Cristina Bosco

Found 25 papers, 2 papers with code

Italian NLP for Everyone: Resources and Models from EVALITA to the European Language Grid

no code implementations LREC 2022 Valerio Basile, Cristina Bosco, Michael Fell, Viviana Patti, Rossella Varvara

The European Language Grid enables researchers and practitioners to easily distribute and use NLP resources and models, such as corpora and classifiers.

Do Dependency Relations Help in the Task of Stance Detection?

no code implementations insights (ACL) 2022 Alessandra Teresa Cignarella, Cristina Bosco, Paolo Rosso

Furthermore, we study the phenomenon of stance with respect to six different targets – one per language, and two different for Italian – employing a variety of machine learning algorithms that primarily exploit morphological and syntactic knowledge as features, represented throughout the format of Universal Dependencies.

Stance Detection

Multilingual Irony Detection with Dependency Syntax and Neural Models

1 code implementation COLING 2020 Alessandra Teresa Cignarella, Valerio Basile, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso, Farah Benamara

This paper presents an in-depth investigation of the effectiveness of dependency-based syntactic features on the irony detection task in a multilingual perspective (English, Spanish, French and Italian).

Word Embeddings

Treebanking User-Generated Content: a UD Based Overview of Guidelines, Corpora and Unified Recommendations

no code implementations3 Nov 2020 Manuela Sanguinetti, Lauren Cassidy, Cristina Bosco, Özlem Çetinoğlu, Alessandra Teresa Cignarella, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djamé Seddah, Amir Zeldes

This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework of syntactic analysis.

Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies

no code implementations LREC 2020 Manuela Sanguinetti, Cristina Bosco, Lauren Cassidy, {\"O}zlem {\c{C}}etino{\u{g}}lu, Aless Cignarella, ra Teresa, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djam{\'e} Seddah, Amir Zeldes

The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework.

Marking Irony Activators in a Universal Dependencies Treebank: The Case of an Italian Twitter Corpus

no code implementations LREC 2020 Aless Cignarella, ra Teresa, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso

In this paper we describe a fine-grained annotation scheme centered on irony, in which we highlight the tokens that are responsible for its activation, (irony activators) and their morpho-syntactic features.

Sentiment Analysis

SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter

no code implementations SEMEVAL 2019 Valerio Basile, Cristina Bosco, Elisabetta Fersini, Debora Nozza, Viviana Patti, Francisco Manuel Rangel Pardo, Paolo Rosso, Manuela Sanguinetti

The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in Spanish and English messages extracted from Twitter.

Exploring the Impact of Pragmatic Phenomena on Irony Detection in Tweets: A Multilingual Corpus Study

no code implementations EACL 2017 Jihen Karoui, Farah Benamara, V{\'e}ronique Moriceau, Viviana Patti, Cristina Bosco, Nathalie Aussenac-Gilles

This paper provides a linguistic and pragmatic analysis of the phenomenon of irony in order to represent how Twitter{'}s users exploit irony devices within their communication strategies for generating textual contents.

Sentiment Analysis

Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous

no code implementations LREC 2016 Cristina Bosco, Mirko Lai, Viviana Patti, Daniela Virone

The annotation process is presented and the disagreement discussed, in particular, in the perspective of figurative language use and in that of the semantic oriented annotation, which are open challenges for NLP systems.

Sentiment Analysis

Annotating Sentiment and Irony in the Online Italian Political Debate on \#labuonascuola

no code implementations LREC 2016 Marco Stranisci, Cristina Bosco, Delia Iraz{\'u} Hern{\'a}ndez Far{\'\i}as, Viviana Patti

In this paper we present the TWitterBuonaScuola corpus (TW-BS), a novel Italian linguistic resource for Sentiment Analysis, developed with the main aim of analyzing the online debate on the controversial Italian political reform {``}Buona Scuola{''} (Good school), aimed at reorganizing the national educational and training systems.

Sentiment Analysis

Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies

no code implementations LREC 2014 Maria Simi, Cristina Bosco, Simonetta Montemagni

This is done by comparing the performance of a statistical parser (DeSR) trained on a simpler resource (the augmented version of the Merged Italian Dependency Treebank or MIDT+) and whose output was automatically converted to SD, with the results of the parser directly trained on ISDT.

TAG

Exploiting catenae in a parallel treebank alignment

no code implementations LREC 2014 Manuela Sanguinetti, Cristina Bosco, Loredana Cupi

This paper aims to introduce the issues related to the syntactic alignment of a dependency-based multilingual parallel treebank, ParTUT.

Translation

A treebank-based study on the influence of Italian word order on parsing performance

no code implementations LREC 2012 Anita Alicante, Cristina Bosco, Anna Corazza, Alberto Lavelli

The aim of this paper is to contribute to the debate on the issues raised by Morphologically Rich Languages, and more precisely to investigate, in a cross-paradigm perspective, the influence of the constituent order on the data-driven parsing of one of such languages(i. e. Italian).

Constituency Parsing Dependency Parsing

The Parallel-TUT: a multilingual and multiformat treebank

no code implementations LREC 2012 Cristina Bosco, Manuela Sanguinetti, Leonardo Lesmo

The paper introduces an ongoing project for the development of a parallel treebank for Italian, English and French, i. e.

Machine Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.