Hate Speech Detection

164 papers with code • 14 benchmarks • 39 datasets

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Benchmarks

Add a Result

These leaderboards are used to track progress in Hate Speech Detection

Dataset	Best Model	Compare
Ethos Binary	BiLSTM + static BE	See all
HateXplain	BERT-MRP	See all
Ethos MultiLabel	MLARAM	See all
Waseem et al., 2018	Mozafari et al., 2019	See all
Automatic Misogynistic Identification	mBert	See all
ToLD-Br	Multilingual BERT	See all
OffensEval 2019	HateBERT	See all
AbusEval	HateBERT	See all
HatEval	HateBERT	See all
Hostility Detection Dataset in Hindi	Auxiliary IndicBert	See all
bajer_danish_misogyny	AOM mBERT	See all
DKhate	Baseline	See all
SHAJ	Baseline BERT (task A)	See all
OLID	RoBERTa-large-ST	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Hate Speech Detection models and implementations

l3cube-pune/MarathiNLP

5 papers

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter

zeerakw/hatespeech • WS 2016

Paper
Code

Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis

UCSM-DUE/IWG_hatespeech_public • 27 Jan 2017

Some users of social media are spreading racist, sexist, and otherwise hateful content.

Paper
Code

Deep Learning for Hate Speech Detection in Tweets

pinkeshbadjatiya/twitter-hatespeech • • 1 Jun 2017

Hate speech detection on Twitter is critical for applications like controversial event extraction, building AI chatterbots, content recommendation, and sentiment analysis.

Paper
Code

Surfacing contextual hate speech words within social media

JherezTaylor/hatespeech_codewords • 28 Nov 2017

As an example, "skypes", "googles", and "yahoos" are all instances of words which have an alternate meaning that can be used for hate speech.

Paper
Code

Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter

ziqizhang/chase • 27 Feb 2018

Our methods are evaluated on the largest collection of hate speech datasets based on Twitter, and are shown to be able to outperform the best performing method by up to 5 percentage points in macro-average F1, or 8 percentage points in the more challenging case of identifying hateful content.

Paper
Code