no code implementations • 27 Mar 2024 • Andrea Esuli, Giovanni Puccetti
This results in a lower number of available benchmarks to evaluate the performance of language models in Italian.
1 code implementation • 9 Mar 2023 • Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato
This paper explores the task of detecting images generated by text-to-image diffusion models.
no code implementations • 2 Aug 2022 • Barbara Berti, Andrea Esuli, Fabrizio Sebastiani
We focus on a different facet of the NLI task, i. e., that of analysing the internals of an NLI classifier trained by an \emph{explainable} machine learning algorithm, in order to obtain explanations of its classification decisions, with the ultimate goal of gaining insight into which linguistic phenomena ``give a speaker's native language away''.
1 code implementation • IEEE Access 2022 • Andrea Esuli
The application uses machine learning to continuously fit automatic classification models that are in turn used to actively support its users with classification suggestions.
2 code implementations • 21 Jun 2022 • Nicola Messina, Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi
With the increased accessibility of web and online encyclopedias, the amount of data to manage is constantly increasing.
no code implementations • 22 Nov 2021 • Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani
LeQua 2022 is a new lab for the evaluation of methods for "learning to quantify" in textual datasets, i. e., for training predictors of the relative frequencies of the classes of interest in sets of unlabelled textual documents.
1 code implementation • 17 Sep 2021 • Alessandro Fabris, Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani
More in detail, we show that fairness under unawareness can be cast as a quantification problem and solved with proven methods from the quantification literature.
1 code implementation • 18 Jun 2021 • Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani
prevalence values) of the classes of interest in a sample of unlabelled data.
1 code implementation • 12 Aug 2020 • Nicola Messina, Giuseppe Amato, Andrea Esuli, Fabrizio Falchi, Claudio Gennaro, Stéphane Marchand-Maillet
In this work, we tackle the task of cross-modal retrieval through image-sentence matching based on word-region alignments, using supervision only at the global image-sentence level.
Ranked #6 on Image Retrieval on Flickr30K 1K test
1 code implementation • 20 Apr 2020 • Nicola Messina, Fabrizio Falchi, Andrea Esuli, Giuseppe Amato
State-of-the-art results in image-text matching are achieved by inter-playing image and text features from the two different processing pipelines, usually using mutual attention mechanisms.
2 code implementations • 26 Nov 2019 • Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani
Pre-trained word embeddings encode general word semantics and lexical regularities of natural language, and have proven useful across many NLP tasks, including word sense disambiguation, machine translation, and sentiment analysis, to name a few.
3 code implementations • 16 Apr 2019 • Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani
Cross-lingual sentiment quantification (and cross-lingual \emph{text} quantification in general) has never been discussed before in the literature; we establish baseline results for the binary case by combining state-of-the-art quantification methods with methods capable of generating cross-lingual vectorial representations of the source and target documents involved.
Cross-Lingual Sentiment Classification General Classification +2
no code implementations • 28 Mar 2019 • Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani
We will show that, for the same amount of training effort, interactive learning delivers much better coding accuracy than standard "non-interactive" learning.
1 code implementation • 28 Mar 2019 • Alejandro Moreo Fernández, Andrea Esuli, Fabrizio Sebastiani
In information retrieval (IR) and related tasks, term weighting approaches typically consider the frequency of the term in the document and in the collection in order to compute a score reflecting the importance of the term for the document.
1 code implementation • 31 Jan 2019 • Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani
Funnelling consists of generating a two-tier classification system where all documents, irrespectively of language, are classified by the same (2nd-tier) classifier.
1 code implementation • 19 Oct 2018 • Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani
This paper introduces PyDCI, a new implementation of Distributional Correspondence Indexing (DCI) written in Python.
Ranked #2 on Sentiment Analysis on Multi-Domain Sentiment Dataset
1 code implementation • 4 Sep 2018 • Andrea Esuli, Alejandro Moreo Fernández, Fabrizio Sebastiani
Quantification is a supervised learning task that consists in predicting, given a set of classes C and a set D of unlabelled items, the prevalence (or relative frequency) p(c|D) of each class c in C. Quantification can in principle be solved by classifying all the unlabelled items and counting how many of them have been attributed to each class.
no code implementations • 21 Jun 2017 • Andrea Esuli, Tiziano Fagni, Alejandro Moreo Fernandez
JaTeCS is an open source Java library that supports research on automatic text categorization and other related problems, such as ordinal regression and quantification, which are of special interest in opinion mining applications.
no code implementations • 20 Apr 2017 • Fabio Carrara, Andrea Esuli, Fabrizio Falchi, Alejandro Moreo Fernández
The recently proposed stochastic residual networks selectively activate or bypass the layers during training, based on independent stochastic choices, each of which following a probability distribution that is fixed in advance.
2 code implementations • 23 Jun 2016 • Fabio Carrara, Andrea Esuli, Tiziano Fagni, Fabrizio Falchi, Alejandro Moreo Fernández
We choose to implement the actual search process as a similarity search in a visual feature space, by learning to translate a textual query into a visual representation.
no code implementations • 2 Mar 2015 • Giacomo Berardi, Andrea Esuli, Fabrizio Sebastiani
\emph{Semi-Automated Text Classification} (SATC) may be defined as the task of ranking a set $\mathcal{D}$ of automatically labelled textual documents in such a way that, if a human annotator validates (i. e., inspects and corrects where appropriate) the documents in a top-ranked portion of $\mathcal{D}$ with the goal of increasing the overall labelling accuracy of $\mathcal{D}$, the expected increase is maximized.
no code implementations • 19 Feb 2015 • Andrea Esuli, Fabrizio Sebastiani
We address the problem of \emph{quantification}, a supervised learning task whose goal is, given a class, to estimate the relative frequency (or \emph{prevalence}) of the class in a dataset of unlabelled items.
no code implementations • 6 Jun 2013 • Andrea Esuli
With the release of SentiWordNet 3. 0 the related Web interface has been restyled and improved in order to allow users to submit feedback on the SentiWordNet entries, in the form of the suggestion of alternative triplets of values for an entry.