no code implementations • VarDial (COLING) 2020 • Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubešić, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri
This paper presents the results of the VarDial Evaluation Campaign 2020 organized as part of the seventh workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with COLING 2020.
1 code implementation • VarDial (COLING) 2022 • Noëmi Aepli, Antonios Anastasopoulos, Adrian-Gabriel Chifu, William Domingues, Fahim Faisal, Mihaela Gaman, Radu Tudor Ionescu, Yves Scherrer
This report presents the results of the shared tasks organized as part of the VarDial Evaluation Campaign 2022.
1 code implementation • 15 Dec 2022 • Mihaela Gaman, Adrian-Gabriel Chifu, William Domingues, Radu Tudor Ionescu
We present a novel corpus for French dialect identification comprising 413, 522 French text samples collected from public news websites in Belgium, Canada, France and Switzerland.
no code implementations • 28 Jan 2022 • Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu
An important preliminary step of optical character recognition systems is the detection of text rows.
1 code implementation • ACL 2021 • Ana-Cristina Rogoz, Mihaela Gaman, Radu Tudor Ionescu
In this work, we introduce a corpus for satire detection in Romanian news.
no code implementations • EACL (VarDial) 2021 • Mihaela Gaman, Sebastian Cojocariu, Radu Tudor Ionescu
In this work, we describe our approach addressing the Social Media Variety Geolocation task featured in the 2021 VarDial Evaluation Campaign.
1 code implementation • 11 Jan 2021 • Anca Maria Tache, Mihaela Gaman, Radu Tudor Ionescu
Romanian is one of the understudied languages in computational linguistics, with few resources available for the development of natural language processing tools.
no code implementations • VarDial (COLING) 2020 • Mihaela Gaman, Radu Tudor Ionescu
From simple models for regression, such as Support Vector Regression, to deep neural networks, such as Long Short-Term Memory networks and character-level convolutional neural networks, and, finally, to ensemble models based on meta-learners, such as XGBoost, our interest is focused on approaching the problem from a few different perspectives, in an attempt to minimize the prediction error.
1 code implementation • ACL 2019 • Daniel Preotiuc-Pietro, Mihaela Gaman, Nikolaos Aletras
Complaining is a basic speech act regularly used in human and computer mediated communication to express a negative mismatch between reality and expectations in a particular situation.