Hate Speech Detection

164 papers with code • 14 benchmarks • 39 datasets

Hate speech detection is the task of detecting if communication such as text, audio, and so on contains hatred and or encourages violence towards a person or a group of people. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age et al. Some example benchmarks are ETHOS and HateXplain. Models can be evaluated with metrics like the F-score or F-measure.

Benchmarks

Add a Result

These leaderboards are used to track progress in Hate Speech Detection

Dataset	Best Model	Compare
Ethos Binary	BiLSTM + static BE	See all
HateXplain	BERT-MRP	See all
Ethos MultiLabel	MLARAM	See all
Waseem et al., 2018	Mozafari et al., 2019	See all
Automatic Misogynistic Identification	mBert	See all
ToLD-Br	Multilingual BERT	See all
OffensEval 2019	HateBERT	See all
AbusEval	HateBERT	See all
HatEval	HateBERT	See all
Hostility Detection Dataset in Hindi	Auxiliary IndicBert	See all
bajer_danish_misogyny	AOM mBERT	See all
DKhate	Baseline	See all
SHAJ	Baseline BERT (task A)	See all
OLID	RoBERTa-large-ST	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Hate Speech Detection models and implementations

l3cube-pune/MarathiNLP

5 papers

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis

virallab/turkishbertweet • • 29 Nov 2023

Wide us of this language on social media platforms such as Twitter, Instagram, or Tiktok and strategic position of the country in the world politics makes it appealing for the social network researchers and industry.

29 Nov 2023

Paper
Code

Improving Cross-Domain Hate Speech Generalizability with Emotion Knowledge

sy-hong/ek-hs-generalizability • • 24 Nov 2023

Reliable automatic hate speech (HS) detection systems must adapt to the in-flow of diverse new data to curtail hate speech.

24 Nov 2023

Paper
Code

Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study

maikezuefle/latent-feature-splits • • 16 Nov 2023

We challenge hate speech models via new train-test splits of existing datasets that rely on the clustering of models' hidden representations.

16 Nov 2023

Paper
Code

GPT-4V(ision) as A Social Media Analysis Engine

vista-h/gpt-4v_social_media • 13 Nov 2023

Our investigation begins with a preliminary quantitative analysis for each task using existing benchmark datasets, followed by a careful review of the results and a selection of qualitative samples that illustrate GPT-4V's potential in understanding multimodal social media content.

13 Nov 2023

Paper
Code

Automatic Textual Normalization for Hate Speech Detection

anhhoang0529/small-lexnormvihsd • 12 Nov 2023

Our dataset is accessible for research purposes.

12 Nov 2023

Paper
Code

mahaNLP: A Marathi Natural Language Processing Library

l3cube-pune/MarathiNLP • 5 Nov 2023

We present mahaNLP, an open-source natural language processing (NLP) library specifically built for the Marathi language.

05 Nov 2023

Paper
Code

HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning

joonkeekim/hare-hate-speech • • 1 Nov 2023

With the proliferation of social media, accurate detection of hate speech has become critical to ensure safety online.

01 Nov 2023

Paper
Code

K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings

ssu-humane/k-haters • • 24 Oct 2023

This resource is the largest offensive language corpus in Korean and is the first to offer target-specific ratings on a three-point Likert scale, enabling the detection of hate expressions in Korean across varying degrees of offensiveness.

24 Oct 2023

Paper
Code

InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations

dfki-nlp/interrolang • • 9 Oct 2023

While recently developed NLP explainability methods let us open the black box in various ways (Madsen et al., 2022), a missing ingredient in this endeavor is an interactive tool offering a conversational interface.

09 Oct 2023

Paper
Code

KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

dasol-choi/komultitext • • 6 Oct 2023

With the growth of online services, the need for advanced text classification algorithms, such as sentiment analysis and biased text detection, has become increasingly evident.

06 Oct 2023

Paper
Code

Hate Speech Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result