Abusive Language

44 papers with code • 0 benchmarks • 9 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

HateMonitors: Language Agnostic Abuse Detection in Social Media

punyajoy/HateMonitors-HASOC 27 Sep 2019

In this paper, we present our machine learning model, HateMonitor, developed for Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC), a shared task at FIRE 2019.

Demographics Should Not Be the Reason of Toxicity: Mitigating Discrimination in Text Classifications with Instance Weighting

ghzhang233/Non-Discrimination-Learning-for-Text-Classification ACL 2020

In this paper, we formalize the unintended biases in text classification datasets as a kind of selection bias from the non-discrimination distribution to the discrimination distribution.

Aggression and Misogyny Detection using BERT: A Multi-Task Approach

NiloofarSafi/TRAC-2 LREC 2020

In recent times, the focus of the NLP community has increased towards offensive language, aggression, and hate-speech detection. This paper presents our system for TRAC-2 shared task on {``}Aggression Identification{''} (sub-task A) and {``}Misogynistic Aggression Identification{''} (sub-task B).

Intersectional Bias in Hate Speech and Abusive Language Datasets

jaeyk/intersectional-bias-in-ml 12 May 2020

Algorithms are widely applied to detect hate speech and abusive language in social media.

Examining Racial Bias in an Online Abuse Corpus with Structural Topic Modeling

db758/icwsm_data_challenge 26 May 2020

We then use structural topic modeling to examine the content of the tweets and how the prevalence of different topics is related to both abusiveness annotation and dialect prediction.

Detect All Abuse! Toward Universal Abusive Language Detection Models

usydnlp/MACAS COLING 2020

Online abusive language detection (ALD) has become a societal issue of increasing importance in recent years.

HateBERT: Retraining BERT for Abusive Language Detection in English

tommasoc80/HateBERT ACL (WOAH) 2021

In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in English.

"Nice Try, Kiddo": Investigating Ad Hominems in Dialogue Responses

ewsheng/ad-hom-in-dialogue 24 Oct 2020

Ad hominem attacks are those that target some feature of a person's character instead of the position the person is maintaining.

A study of text representations in Hate Speech Detection

cthem/hate-speech-detection 8 Feb 2021

The pervasiveness of the Internet and social media have enabled the rapid and anonymous spread of Hate Speech content on microblogging platforms such as Twitter.

Abuse is Contextual, What about NLP? The Role of Context in Abusive Language Annotation and Detection

dhfbk/twitter-abusive-context-dataset 27 Mar 2021

We first re-annotate part of a widely used dataset for abusive language detection in English in two conditions, i. e. with and without context.