Search Results for author: Mihaela Gaman

Found 9 papers, 5 papers with code

A Report on the VarDial Evaluation Campaign 2020

no code implementations • VarDial (COLING) 2020 • Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubešić, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri

This paper presents the results of the VarDial Evaluation Campaign 2020 organized as part of the seventh workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with COLING 2020.

Dialect Identification

Paper
Add Code

Findings of the VarDial Evaluation Campaign 2022

1 code implementation • VarDial (COLING) 2022 • Noëmi Aepli, Antonios Anastasopoulos, Adrian-Gabriel Chifu, William Domingues, Fahim Faisal, Mihaela Gaman, Radu Tudor Ionescu, Yves Scherrer

This report presents the results of the shared tasks organized as part of the VarDial Evaluation Campaign 2022.

Dialect Identification Extractive Question-Answering +1

Paper
Code

FreCDo: A Large Corpus for French Cross-Domain Dialect Identification

1 code implementation • 15 Dec 2022 • Mihaela Gaman, Adrian-Gabriel Chifu, William Domingues, Radu Tudor Ionescu

We present a novel corpus for French dialect identification comprising 413, 522 French text samples collected from public news websites in Belgium, Canada, France and Switzerland.

Dialect Identification

Paper
Code

Self-paced learning to improve text row detection in historical documents with missing labels

no code implementations • 28 Jan 2022 • Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu

An important preliminary step of optical character recognition systems is the detection of text rows.

Missing Labels Optical Character Recognition +1

Paper
Add Code

SaRoCo: Detecting Satire in a Novel Romanian Corpus of News Articles

1 code implementation • ACL 2021 • Ana-Cristina Rogoz, Mihaela Gaman, Radu Tudor Ionescu

In this work, we introduce a corpus for satire detection in Romanian news.

Satire Detection

Paper
Code

UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning

no code implementations • EACL (VarDial) 2021 • Mihaela Gaman, Sebastian Cojocariu, Radu Tudor Ionescu

In this work, we describe our approach addressing the Social Media Variety Geolocation task featured in the 2021 VarDial Evaluation Campaign.

Dialect Identification Ensemble Learning +1

Paper
Add Code

Clustering Word Embeddings with Self-Organizing Maps. Application on LaRoSeDa -- A Large Romanian Sentiment Data Set

1 code implementation • 11 Jan 2021 • Anca Maria Tache, Mihaela Gaman, Radu Tudor Ionescu

Romanian is one of the understudied languages in computational linguistics, with few resources available for the development of natural language processing tools.

Clustering Sentiment Analysis +3

Paper
Code

Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets

no code implementations • VarDial (COLING) 2020 • Mihaela Gaman, Radu Tudor Ionescu

From simple models for regression, such as Support Vector Regression, to deep neural networks, such as Long Short-Term Memory networks and character-level convolutional neural networks, and, finally, to ensemble models based on meta-learners, such as XGBoost, our interest is focused on approaching the problem from a few different perspectives, in an attempt to minimize the prediction error.

Dialect Identification regression

Paper
Add Code

Automatically Identifying Complaints in Social Media

1 code implementation • ACL 2019 • Daniel Preotiuc-Pietro, Mihaela Gaman, Nikolaos Aletras

Complaining is a basic speech act regularly used in human and computer mediated communication to express a negative mismatch between reality and expectations in a particular situation.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.