Speech Emotion Recognition

98 papers with code • 14 benchmarks • 18 datasets

Speech Emotion Recognition is a task of speech processing and computational paralinguistics that aims to recognize and categorize the emotions expressed in spoken language. The goal is to determine the emotional state of a speaker, such as happiness, anger, sadness, or frustration, from their speech patterns, such as prosody, pitch, and rhythm.

For multimodal emotion recognition, please upload your result to Multimodal Emotion Recognition on IEMOCAP

Benchmarks

Add a Result

These leaderboards are used to track progress in Speech Emotion Recognition

Dataset	Best Model	Compare
IEMOCAP	DANN	See all
CREMA-D	ConformerXL-P	See all
RAVDESS	VQ-MAE-S-12 (Frame) + Query2Emo	See all
MSP-Podcast (Valence)	w2v2-L-robust-12	See all
MSP-Podcast (Activation)	w2v2-L-robust-12	See all
MSP-Podcast (Dominance)	w2v2-L-robust-12	See all
ShEMO	CNN (1D)	See all
EmoDB Dataset	VQ-MAE-S-12 (Frame) + Query2Emo	See all
Dusha Crowd	Dusha baseline	See all
Dusha Podcast	Dusha baseline	See all
LSSED	PyResNet	See all
EMODB	VGG-optiVMD	See all
Quechua-SER	LSTM	See all
MSP-IMPROV	emoDARTS	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Speech Emotion Recognition models and implementations

raulsteleac/Speech_Emotion_Recognit…

3 papers

alibaba-damo-academy/FunASR

2 papers

3,141

aris-ai/Audio-and-text-based-emotio…

2 papers

138

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

SERAB: A multi-lingual benchmark for speech emotion recognition

neclow/serab • • 7 Oct 2021

To facilitate the process, here, we present the Speech Emotion Recognition Adaptation Benchmark (SERAB), a framework for evaluating the performance and generalization capacity of different approaches for utterance-level SER.

Paper
Code

Speech Emotion Diarization: Which Emotion Appears When?

speechbrain/speechbrain • • 22 Jun 2023

Speech Emotion Recognition (SER) typically relies on utterance-level solutions.

Paper
Code

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

ddlBoJack/emotion2vec • • 23 Dec 2023

To the best of our knowledge, emotion2vec is the first universal representation model in various emotion-related tasks, filling a gap in the field.

Paper
Code

nEMO: Dataset of Emotional Speech in Polish

amu-cai/nemo • 9 Apr 2024

Speech emotion recognition has become increasingly important in recent years due to its potential applications in healthcare, customer service, and personalization of dialogue systems.

Paper
Code

Transfer Learning for Improving Speech Emotion Classification Accuracy

raulsteleac/Speech_Emotion_Recognition • • 19 Jan 2018

The majority of existing speech emotion recognition research focuses on automatic emotion detection using training and testing data from same corpus collected under the same conditions.

Paper
Code

Attention Based Fully Convolutional Network for Speech Emotion Recognition

aris-ai/Audio-and-text-based-emotion-recognition • • 5 Jun 2018

In this paper, we present a novel attention based fully convolutional network for speech emotion recognition.

Paper
Code

Evaluating Gammatone Frequency Cepstral Coefficients with Neural Networks for Emotion Recognition from Speech

SoyBison/gammatone • 23 Jun 2018

Mel Frequency Cepstral Coefficients (MFCCs) are one of the most commonly used representations for audio speech recognition and classification.

Paper
Code

The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems

numediart/EmoV-DB • 25 Jun 2018

In this paper, we present a database of emotional speech intended to be open-sourced and used for synthesis and generation purpose.

Paper
Code

Integrating Recurrence Dynamics for Speech Emotion Recognition

etzinis/nldrp • 9 Nov 2018

We investigate the performance of features that can capture nonlinear recurrence dynamics embedded in the speech signal for the task of Speech Emotion Recognition (SER).

Paper
Code

Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages

siddiquelatif/URDU-Dataset • 15 Dec 2018

Cross-lingual speech emotion recognition is an important task for practical applications.

Paper
Code

Speech Emotion Recognition

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result