Search Results for author: Andrea Esuli

Found 25 papers, 14 papers with code

The Invalsi Benchmark: measuring Language Models Mathematical and Language understanding in Italian

no code implementations27 Mar 2024 Andrea Esuli, Giovanni Puccetti

This results in a lower number of available benchmarks to evaluate the performance of language models in Italian.

Language Modelling

Detecting Images Generated by Diffusers

1 code implementation9 Mar 2023 Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato

This paper explores the task of detecting images generated by text-to-image diffusion models.

Unravelling Interlanguage Facts via Explainable Machine Learning

no code implementations2 Aug 2022 Barbara Berti, Andrea Esuli, Fabrizio Sebastiani

We focus on a different facet of the NLI task, i. e., that of analysing the internals of an NLI classifier trained by an \emph{explainable} machine learning algorithm, in order to obtain explanations of its classification decisions, with the ultimate goal of gaining insight into which linguistic phenomena ``give a speaker's native language away''.

BIG-bench Machine Learning Native Language Identification

ICS: Total Freedom in Manual Text Classification Supported by Unobtrusive Machine Learning

1 code implementation IEEE Access 2022 Andrea Esuli

The application uses machine learning to continuously fit automatic classification models that are in turn used to actively support its users with classification suggestions.

Active Learning text-classification +1

Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching

2 code implementations21 Jun 2022 Nicola Messina, Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi

With the increased accessibility of web and online encyclopedias, the amount of data to manage is constantly increasing.

LeQua@CLEF2022: Learning to Quantify

no code implementations22 Nov 2021 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

LeQua 2022 is a new lab for the evaluation of methods for "learning to quantify" in textual datasets, i. e., for training predictors of the relative frequencies of the classes of interest in sets of unlabelled textual documents.

Binary Quantification Multiclass Quantification

Measuring Fairness Under Unawareness of Sensitive Attributes: A Quantification-Based Approach

1 code implementation17 Sep 2021 Alessandro Fabris, Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

More in detail, we show that fairness under unawareness can be cast as a quantification problem and solved with proven methods from the quantification literature.

Fairness

QuaPy: A Python-Based Framework for Quantification

1 code implementation18 Jun 2021 Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani

prevalence values) of the classes of interest in a sample of unlabelled data.

Model Selection

Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders

1 code implementation12 Aug 2020 Nicola Messina, Giuseppe Amato, Andrea Esuli, Fabrizio Falchi, Claudio Gennaro, Stéphane Marchand-Maillet

In this work, we tackle the task of cross-modal retrieval through image-sentence matching based on word-region alignments, using supervision only at the global image-sentence level.

Cross-Modal Retrieval Image Retrieval +3

Transformer Reasoning Network for Image-Text Matching and Retrieval

1 code implementation20 Apr 2020 Nicola Messina, Fabrizio Falchi, Andrea Esuli, Giuseppe Amato

State-of-the-art results in image-text matching are achieved by inter-playing image and text features from the two different processing pipelines, usually using mutual attention mechanisms.

Image Retrieval Image-text matching +3

Word-Class Embeddings for Multiclass Text Classification

2 code implementations26 Nov 2019 Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani

Pre-trained word embeddings encode general word semantics and lexical regularities of natural language, and have proven useful across many NLP tasks, including word sense disambiguation, machine translation, and sentiment analysis, to name a few.

General Classification Machine Translation +6

Cross-Lingual Sentiment Quantification

3 code implementations16 Apr 2019 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

Cross-lingual sentiment quantification (and cross-lingual \emph{text} quantification in general) has never been discussed before in the literature; we establish baseline results for the binary case by combining state-of-the-art quantification methods with methods capable of generating cross-lingual vectorial representations of the source and target documents involved.

Cross-Lingual Sentiment Classification General Classification +2

Building Automated Survey Coders via Interactive Machine Learning

no code implementations28 Mar 2019 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

We will show that, for the same amount of training effort, interactive learning delivers much better coding accuracy than standard "non-interactive" learning.

BIG-bench Machine Learning

Learning to Weight for Text Classification

1 code implementation28 Mar 2019 Alejandro Moreo Fernández, Andrea Esuli, Fabrizio Sebastiani

In information retrieval (IR) and related tasks, term weighting approaches typically consider the frequency of the term in the document and in the collection in order to compute a score reflecting the importance of the term for the document.

General Classification Information Retrieval +3

Funnelling: A New Ensemble Method for Heterogeneous Transfer Learning and its Application to Cross-Lingual Text Classification

1 code implementation31 Jan 2019 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

Funnelling consists of generating a two-tier classification system where all documents, irrespectively of language, are classified by the same (2nd-tier) classifier.

Ensemble Learning General Classification +3

A Recurrent Neural Network for Sentiment Quantification

1 code implementation4 Sep 2018 Andrea Esuli, Alejandro Moreo Fernández, Fabrizio Sebastiani

Quantification is a supervised learning task that consists in predicting, given a set of classes C and a set D of unlabelled items, the prevalence (or relative frequency) p(c|D) of each class c in C. Quantification can in principle be solved by classifying all the unlabelled items and counting how many of them have been attributed to each class.

JaTeCS an open-source JAva TExt Categorization System

no code implementations21 Jun 2017 Andrea Esuli, Tiziano Fagni, Alejandro Moreo Fernandez

JaTeCS is an open source Java library that supports research on automatic text categorization and other related problems, such as ordinal regression and quantification, which are of special interest in opinion mining applications.

feature selection Opinion Mining +1

Exploring epoch-dependent stochastic residual networks

no code implementations20 Apr 2017 Fabio Carrara, Andrea Esuli, Fabrizio Falchi, Alejandro Moreo Fernández

The recently proposed stochastic residual networks selectively activate or bypass the layers during training, based on independent stochastic choices, each of which following a probability distribution that is fixed in advance.

Management

Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions

2 code implementations23 Jun 2016 Fabio Carrara, Andrea Esuli, Tiziano Fagni, Fabrizio Falchi, Alejandro Moreo Fernández

We choose to implement the actual search process as a similarity search in a visual feature space, by learning to translate a textual query into a visual representation.

Cross-Modal Retrieval Descriptive +2

Utility-Theoretic Ranking for Semi-Automated Text Classification

no code implementations2 Mar 2015 Giacomo Berardi, Andrea Esuli, Fabrizio Sebastiani

\emph{Semi-Automated Text Classification} (SATC) may be defined as the task of ranking a set $\mathcal{D}$ of automatically labelled textual documents in such a way that, if a human annotator validates (i. e., inspects and corrects where appropriate) the documents in a top-ranked portion of $\mathcal{D}$ with the goal of increasing the overall labelling accuracy of $\mathcal{D}$, the expected increase is maximized.

General Classification text-classification +1

Optimizing Text Quantifiers for Multivariate Loss Functions

no code implementations19 Feb 2015 Andrea Esuli, Fabrizio Sebastiani

We address the problem of \emph{quantification}, a supervised learning task whose goal is, given a class, to estimate the relative frequency (or \emph{prevalence}) of the class in a dataset of unlabelled items.

Structured Prediction

The User Feedback on SentiWordNet

no code implementations6 Jun 2013 Andrea Esuli

With the release of SentiWordNet 3. 0 the related Web interface has been restyled and improved in order to allow users to submit feedback on the SentiWordNet entries, in the form of the suggestion of alternative triplets of values for an entry.

Cannot find the paper you are looking for? You can Submit a new open access paper.