Search Results for author: Alejandro Moreo

Found 21 papers, 14 papers with code

Quantification using Permutation-Invariant Networks based on Histograms

1 code implementation22 Mar 2024 Olaya Pérez-Mon, Alejandro Moreo, Juan José del Coz, Pablo González

Quantification, also known as class prevalence estimation, is the supervised learning task in which a model is trained to predict the prevalence of each class in a given bag of examples.

Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation

no code implementations17 Mar 2024 Silvia Corbara, Alejandro Moreo

In this paper, we investigate the potential benefits of augmenting the classifier training set with (negative) synthetic examples.

Authorship Verification Data Augmentation +2

Kernel Density Estimation for Multiclass Quantification

no code implementations31 Dec 2023 Alejandro Moreo, Pablo González, Juan José del Coz

Several disciplines, like the social sciences, epidemiology, sentiment analysis, or market research, are interested in knowing the distribution of the classes in a population rather than the individual labels of the members thereof.

Density Estimation Epidemiology +2

Explainable Authorship Identification in Cultural Heritage Applications: Analysis of a New Perspective

no code implementations3 Nov 2023 Mattia Setzu, Silvia Corbara, Anna Monreale, Alejandro Moreo, Fabrizio Sebastiani

While a substantial amount of work has recently been devoted to enhance the performance of computational Authorship Identification (AId) systems, little to no attention has been paid to endowing AId systems with the ability to explain the reasons behind their predictions.

Authorship Attribution Authorship Verification +3

Regularization-Based Methods for Ordinal Quantification

1 code implementation13 Oct 2023 Mirko Bunse, Alejandro Moreo, Fabrizio Sebastiani, Martin Senz

Quantification, i. e., the task of training predictors of the class prevalence values in sets of unlabeled data items, has received increased attention in recent years.

Binary Quantification and Dataset Shift: An Experimental Investigation

1 code implementation6 Oct 2023 Pablo González, Alejandro Moreo, Fabrizio Sebastiani

One finding that results from this investigation is that many existing quantification methods that had been found robust to prior probability shift are not necessarily robust to other types of dataset shift.

Binary Quantification

Same or Different? Diff-Vectors for Authorship Analysis

1 code implementation24 Jan 2023 Silvia Corbara, Alejandro Moreo, Fabrizio Sebastiani

We investigate the effects on authorship identification tasks of a fundamental shift in how to conceive the vectorial representations of documents that are given as input to a supervised learner.

Authorship Attribution Authorship Verification

Multi-Label Quantification

1 code implementation15 Nov 2022 Alejandro Moreo, Manuel Francisco, Fabrizio Sebastiani

While many quantification methods have been proposed in the past for binary problems and, to a lesser extent, single-label multiclass problems, the multi-label setting (i. e., the scenario in which the classes of interest are not mutually exclusive) remains by and large unexplored.

Binary Quantification

LeQua@CLEF2022: Learning to Quantify

no code implementations22 Nov 2021 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

LeQua 2022 is a new lab for the evaluation of methods for "learning to quantify" in textual datasets, i. e., for training predictors of the relative frequencies of the classes of interest in sets of unlabelled textual documents.

Binary Quantification Multiclass Quantification

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

1 code implementation27 Oct 2021 Silvia Corbara, Alejandro Moreo, Fabrizio Sebastiani

It is well known that, within the Latin production of written text, peculiar metric schemes were followed not only in poetic compositions, but also in many prose works.

Authorship Attribution

Measuring Fairness Under Unawareness of Sensitive Attributes: A Quantification-Based Approach

1 code implementation17 Sep 2021 Alessandro Fabris, Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

More in detail, we show that fairness under unawareness can be cast as a quantification problem and solved with proven methods from the quantification literature.

Fairness

Generalized Funnelling: Ensemble Learning and Heterogeneous Document Embeddings for Cross-Lingual Text Classification

no code implementations17 Sep 2021 Alejandro Moreo, Andrea Pedrotti, Fabrizio Sebastiani

In this ensemble method, 1st-tier classifiers, each working on a different and language-dependent feature space, return a vector of calibrated posterior probabilities (with one dimension for each class) for each document, and the final classification decision is taken by a metaclassifier that uses this vector as its input.

Ensemble Learning Multilabel Text Classification +3

QuaPy: A Python-Based Framework for Quantification

1 code implementation18 Jun 2021 Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani

prevalence values) of the classes of interest in a sample of unlabelled data.

Model Selection

Re-Assessing the "Classify and Count" Quantification Method

1 code implementation4 Nov 2020 Alejandro Moreo, Fabrizio Sebastiani

This task originated with the observation that "Classify and Count" (CC), the trivial method of obtaining class prevalence estimates, is often a biased estimator, and thus delivers suboptimal quantification accuracy; following this observation, several methods for learning to quantify have been proposed that have been shown to outperform CC.

General Classification Sentiment Analysis +1

Tweet Sentiment Quantification: An Experimental Re-Evaluation

1 code implementation4 Nov 2020 Alejandro Moreo, Fabrizio Sebastiani

It is well-known that solving quantification by means of ``classify and count'' (i. e., by classifying all unlabelled items by means of a standard classifier and counting the items that have been assigned to a given class) is less than optimal in terms of accuracy, and that more accurate quantification methods exist.

Sentiment Analysis Sentiment Classification

MedLatinEpi and MedLatinLit: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts

no code implementations22 Jun 2020 Silvia Corbara, Alejandro Moreo, Fabrizio Sebastiani, Mirko Tavoni

We present and make available MedLatinEpi and MedLatinLit, two datasets of medieval Latin texts to be used in research on computational authorship analysis.

Authorship Attribution Authorship Verification

Word-Class Embeddings for Multiclass Text Classification

2 code implementations26 Nov 2019 Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani

Pre-trained word embeddings encode general word semantics and lexical regularities of natural language, and have proven useful across many NLP tasks, including word sense disambiguation, machine translation, and sentiment analysis, to name a few.

General Classification Machine Translation +6

Cross-Lingual Sentiment Quantification

3 code implementations16 Apr 2019 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

Cross-lingual sentiment quantification (and cross-lingual \emph{text} quantification in general) has never been discussed before in the literature; we establish baseline results for the binary case by combining state-of-the-art quantification methods with methods capable of generating cross-lingual vectorial representations of the source and target documents involved.

Cross-Lingual Sentiment Classification General Classification +2

Building Automated Survey Coders via Interactive Machine Learning

no code implementations28 Mar 2019 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

We will show that, for the same amount of training effort, interactive learning delivers much better coding accuracy than standard "non-interactive" learning.

BIG-bench Machine Learning

Funnelling: A New Ensemble Method for Heterogeneous Transfer Learning and its Application to Cross-Lingual Text Classification

1 code implementation31 Jan 2019 Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani

Funnelling consists of generating a two-tier classification system where all documents, irrespectively of language, are classified by the same (2nd-tier) classifier.

Ensemble Learning General Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.