TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Speech Emotion Recognition	IEMOCAP	SER with MTL	F1	-	# 2
Speech Emotion Recognition	IEMOCAP	SER with MTL	WA CV	0.789	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/speech-emotion-recognition-with-multi-task/speech-emotion-recognition-on-iemocap)](https://paperswithcode.com/sota/speech-emotion-recognition-on-iemocap?p=speech-emotion-recognition-with-multi-task)`

Speech Emotion Recognition with Multi-Task Learning

Interspeech 2021 · Cai, Xingyu Yuan, Jiahong Zheng, Renjie Huang, Liang Church, Kenneth ·

Speech emotion recognition (SER) classifies speech into emotion categories such as: Happy, Angry, Sad and Neutral. Recently , deep learning has been applied to the SER task. This paper proposes a multi-task learning (MTL) framework to simultaneously perform speech-to-text recognition and emotion classification, with an end-to-end deep neural model based on wav2vec-2.0. Experiments on the IEMOCAP benchmark show that the proposed method achieves the state-of-the-art performance on the SER task. In addition, an ablation study establishes the effectiveness of the proposed MTL framework.

PDF