SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models

SEMEVAL 2020 · Flor Miriam Plaza del Arco, M. Dolores Molina Gonz{\'a}lez, Alfonso Ure{\~n}a-L{\'o}pez, Maite Martin ·

This paper describes the participation of SINAI team at Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media. In particular, the participation in Sub-task A in English which consists of identifying tweets as offensive or not offensive. We preprocess the dataset according to the language characteristics used on social media. Then, we select a small set from the training set provided by the organizers and fine-tune different Transformerbased models in order to test their effectiveness. Our team ranks 20th out of 85 participants in Subtask-A using the XLNet model.

PDF Abstract