no code implementations • 19 Apr 2024 • Chengxin Chen, Pengyuan Zhang
One persistent challenge in Speech Emotion Recognition (SER) is the ubiquitous environmental noise, which frequently results in diminished SER performance in practical use.
1 code implementation • 26 Dec 2023 • Chengxin Chen, Pengyuan Zhang
As a vital aspect of affective computing, Multimodal Emotion Recognition has been an active research area in the multimedia community.
no code implementations • 25 Dec 2023 • Chengxin Chen, Pengyuan Zhang
One persistent challenge in deep learning based speech emotion recognition (SER) is the unconscious encoding of emotion-irrelevant factors (e. g., speaker or phonetic variability), which limits the generalization of SER in practical use.
no code implementations • 25 Apr 2022 • Chengxin Chen, Meng Wang, Pengyuan Zhang
Recently, audio-visual scene classification (AVSC) has attracted increasing attention from multidisciplinary communities.
no code implementations • 31 Mar 2022 • Chengxin Chen, Pengyuan Zhang
To further exploit the embeddings from different layers of the ASR encoder, we propose a novel CTA-RNN architecture to capture the emotional salient parts of embeddings in both the channel and temporal directions.