1 code implementation • 12 Feb 2024 • Puneet Kumar, Sarthak Malik, Balasubramanian Raman, Xiaobai Li
It implements an interpretability technique to analyze the contribution of textual and visual features during the generation of uncontrolled and controlled feedback.
1 code implementation • 25 Aug 2022 • Puneet Kumar, Sarthak Malik, Balasubramanian Raman
A new interpretability technique has been developed to identify the important speech & image features leading to the prediction of particular emotion classes.
1 code implementation • 24 Aug 2022 • Puneet Kumar, Sarthak Malik, Balasubramanian Raman, Xiaobai Li
This paper proposes a multimodal emotion recognition system, VIsual Spoken Textual Additive Net (VISTA Net), to classify emotions reflected by multimodal input containing image, speech, and text into discrete classes.