Search Results for author: Mihaela Gaman

Found 9 papers, 5 papers with code

A Report on the VarDial Evaluation Campaign 2020

no code implementations VarDial (COLING) 2020 Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubešić, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri

This paper presents the results of the VarDial Evaluation Campaign 2020 organized as part of the seventh workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with COLING 2020.

Dialect Identification

FreCDo: A Large Corpus for French Cross-Domain Dialect Identification

1 code implementation15 Dec 2022 Mihaela Gaman, Adrian-Gabriel Chifu, William Domingues, Radu Tudor Ionescu

We present a novel corpus for French dialect identification comprising 413, 522 French text samples collected from public news websites in Belgium, Canada, France and Switzerland.

Dialect Identification

UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning

no code implementations EACL (VarDial) 2021 Mihaela Gaman, Sebastian Cojocariu, Radu Tudor Ionescu

In this work, we describe our approach addressing the Social Media Variety Geolocation task featured in the 2021 VarDial Evaluation Campaign.

Dialect Identification Ensemble Learning +1

Clustering Word Embeddings with Self-Organizing Maps. Application on LaRoSeDa -- A Large Romanian Sentiment Data Set

1 code implementation11 Jan 2021 Anca Maria Tache, Mihaela Gaman, Radu Tudor Ionescu

Romanian is one of the understudied languages in computational linguistics, with few resources available for the development of natural language processing tools.

Clustering Sentiment Analysis +3

Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets

no code implementations VarDial (COLING) 2020 Mihaela Gaman, Radu Tudor Ionescu

From simple models for regression, such as Support Vector Regression, to deep neural networks, such as Long Short-Term Memory networks and character-level convolutional neural networks, and, finally, to ensemble models based on meta-learners, such as XGBoost, our interest is focused on approaching the problem from a few different perspectives, in an attempt to minimize the prediction error.

Dialect Identification regression

Automatically Identifying Complaints in Social Media

1 code implementation ACL 2019 Daniel Preotiuc-Pietro, Mihaela Gaman, Nikolaos Aletras

Complaining is a basic speech act regularly used in human and computer mediated communication to express a negative mismatch between reality and expectations in a particular situation.

Cannot find the paper you are looking for? You can Submit a new open access paper.