Search Results for author: Giulia Venturi

Found 24 papers, 1 papers with code

On the Nature of BERT: Correlating Fine-Tuning and Linguistic Competence

no code implementations COLING 2022 Federica Merendi, Felice Dell’Orletta, Giulia Venturi

Several studies in the literature on the interpretation of Neural Language Models (NLM) focus on the linguistic generalization abilities of pre-trained models.

Making Italian Parliamentary Records Machine-Actionable: the Construction of the ParlaMint-IT corpus

no code implementations ParlaCLARIN (LREC) 2022 Tommaso Agnoloni, Roberto Bartolini, Francesca Frontini, Simonetta Montemagni, Carlo Marchetti, Valeria Quochi, Manuela Ruisi, Giulia Venturi

The corpus contains 1199 sessions and 79, 373 speeches, for a total of about 31 million words and was encoded according to the ParlaCLARIN TEI XML format, as well as in CoNLL-UD format.

POS

SemEval-2022 Task 3: PreTENS-Evaluating Neural Networks on Presuppositional Semantic Knowledge

no code implementations SemEval (NAACL) 2022 Roberto Zamparelli, Shammur Chowdhury, Dominique Brunato, Cristiano Chesi, Felice Dell’Orletta, Md. Arid Hasan, Giulia Venturi

We report the results of the SemEval 2022 Task 3, PreTENS, on evaluation the acceptability of simple sentences containing constructions whose two arguments are presupposed to be or not to be in an ordered taxonomic relation.

Data Augmentation

What Makes My Model Perplexed? A Linguistic Investigation on Neural Language Models Perplexity

no code implementations NAACL (DeeLIO) 2021 Alessio Miaschi, Dominique Brunato, Felice Dell’Orletta, Giulia Venturi

This paper presents an investigation aimed at studying how the linguistic structure of a sentence affects the perplexity of two of the most popular Neural Language Models (NLMs), BERT and GPT-2.

Sentence

Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)

1 code implementation27 Feb 2024 Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi

In this paper, we explore the impact of augmenting pre-trained Encoder-Decoder models, specifically T5, with linguistic knowledge for the prediction of a target task.

Sentence

Linguistic Profiling of a Neural Language Model

no code implementations COLING 2020 Alessio Miaschi, Dominique Brunato, Felice Dell'Orletta, Giulia Venturi

In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems.

Language Modelling Sentence

Tracking the Evolution of Written Language Competence in L2 Spanish Learners

no code implementations WS 2020 Alessio Miaschi, Sam Davidson, Dominique Brunato, Felice Dell{'}Orletta, Kenji Sagae, Claudia Helena Sanchez-Gutierrez, Giulia Venturi

In this paper we present an NLP-based approach for tracking the evolution of written language competence in L2 Spanish learners using a wide range of linguistic features automatically extracted from students{'} written productions.

Profiling-UD: a Tool for Linguistic Profiling of Texts

no code implementations LREC 2020 Dominique Brunato, Andrea Cimino, Felice Dell{'}Orletta, Giulia Venturi, Simonetta Montemagni

In this paper, we introduce Profiling{--}UD, a new text analysis tool inspired to the principles of linguistic profiling that can support language variation research from different perspectives.

Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank

no code implementations WS 2018 Chiara Alzetta, Felice Dell{'}Orletta, Simonetta Montemagni, Maria Simi, Giulia Venturi

For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.

Dependency Parsing

CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence

no code implementations LREC 2016 Alessia Barbagli, Pietro Lucisano, Felice Dell{'}Orletta, Simonetta Montemagni, Giulia Venturi

In this paper, we present the CItA corpus (Corpus Italiano di Apprendenti L1), a collection of essays written by Italian L1 learners collected during the first and second year of lower secondary school.

T2K\textasciicircum2: a System for Automatically Extracting and Organizing Knowledge from Texts

no code implementations LREC 2014 Felice Dell{'}Orletta, Giulia Venturi, Andrea Cimino, Simonetta Montemagni

In this paper, we present T2K{\textasciicircum}2, a suite of tools for automatically extracting domain―specific knowledge from collections of Italian and English texts.

Enriching the ISST-TANL Corpus with Semantic Frames

no code implementations LREC 2012 Aless Lenci, ro, Simonetta Montemagni, Giulia Venturi, Maria Grazia Cutrull{\`a}

The paper describes the design and the results of a manual annotation methodology devoted to enrich the ISST--TANL Corpus, derived from the Italian Syntactic--Semantic Treebank (ISST), with Semantic Frames information.

Cannot find the paper you are looking for? You can Submit a new open access paper.