1 code implementation • 21 Mar 2024 • Ali Ezati, Mohammadreza Dezyani, Rajib Rana, Roozbeh Rajabi, Ahmad Ayatollahi
On the other hand, the PWFS block employs a feature selection mechanism that discards less meaningful features prior to the fusion process.
Facial Expression Recognition Facial Expression Recognition (FER) +1
no code implementations • 21 Mar 2024 • Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Bjorn W. Schuller, Carlos Busso
This study presents emoDARTS, a DARTS-optimised joint CNN and Sequential Neural Network (SeqNN: LSTM, RNN) architecture that enhances SER performance.
Ranked #1 on Speech Emotion Recognition on MSP-IMPROV
no code implementations • 7 Oct 2023 • Chung-Soo Ahn, Jagath C. Rajapakse, Rajib Rana
While speech emotion recognition (SER) research has made significant progress, achieving generalization across various corpora continues to pose a problem.
no code implementations • 22 Jun 2023 • Elias Hossain, Rajib Rana, Niall Higgins, Jeffrey Soar, Prabal Datta Barua, Anthony R. Pisani, Ph. D, Kathryn Turner}
Various Machine Learning (ML), Deep Learning (DL) and NLP techniques are studied and compared to understand the limitations and opportunities in this space comprehensively.
no code implementations • 23 May 2023 • Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn Schuller
In contrast to previous studies, we refrain from imposing constraints on the order of the layers for the CNN within the DARTS cell; instead, we allow DARTS to determine the optimal layer order autonomously.
Ranked #5 on Speech Emotion Recognition on IEMOCAP (UA metric)
1 code implementation • 6 Oct 2022 • Shamane Siriwardhana, Rivindu Weerasekera, Elliott Wen, Tharindu Kaluarachchi, Rajib Rana, Suranga Nanayakkara
We propose \textit{RAG-end2end}, an extension to RAG, that can adapt to a domain-specific knowledge base by updating all components of the external knowledge base during training.
no code implementations • 11 Aug 2022 • Kun Zhou, Berrak Sisman, Rajib Rana, B. W. Schuller, Haizhou Li
We then incorporate our formulation into a sequence-to-sequence emotional text-to-speech framework.
no code implementations • 7 Jul 2022 • Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Bjorn W. Schuller
Evaluation results show that in a live data feed setting, RL-DA outperforms a baseline strategy by 11% and 14% in cross-corpus and cross-language scenarios, respectively.
no code implementations • 10 Jan 2022 • Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li
As desired, the proposed network controls the fine-grained emotion intensity in the output speech.
1 code implementation • 4 Jan 2021 • Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Björn W. Schuller, Jiajun Liu
In addition, extended learning period is a general challenge for deep RL which can impact the speed of learning for SER.
no code implementations • 1 Jun 2020 • Kazi Nazmul Haque, Rajib Rana, Björn W. Schuller
Hence, with the extensive experimental results, we have demonstrated that by harnessing the power of the high-fidelity audio generation, the proposed GAAE model can learn powerful representation from unlabelled dataset leveraging a fewer percentage of labelled data as supervision/guidance.
no code implementations • 5 Mar 2020 • Kazi Nazmul Haque, Rajib Rana, John H. L. Hansen, Björn Schuller
However, the model can become redundant if it is intended for a specific task.
no code implementations • 2 Jan 2020 • Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Junaid Qadir, Björn W. Schuller
Research on speech processing has traditionally considered the task of designing hand-engineered acoustic features (feature engineering) as a separate distinct problem from the task of designing efficient machine learning (ML) models to make prediction and classification decisions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 24 Oct 2019 • Thejan Rajapakshe, Rajib Rana, Siddique Latif, Sara Khalifa, Björn W. Schuller
Deep reinforcement learning (deep RL) is a combination of deep learning with reinforcement learning principles to create efficient methods that can learn by interacting with its environment.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 18 Apr 2019 • Kazi Nazmul Haque, Siddique Latif, Rajib Rana
Learning disentangled representation from any unlabelled data is a non-trivial problem.
no code implementations • 28 Nov 2018 • Siddique Latif, Rajib Rana, Junaid Qadir
Deep learning has undoubtedly offered tremendous improvements in the performance of state-of-the-art speech emotion recognition (SER) systems.
no code implementations • 24 Nov 2018 • Siddique Latif, Muhammad Asim, Muhammad Usman, Junaid Qadir, Rajib Rana
Multishot Magnetic Resonance Imaging (MRI) has recently gained popularity as it accelerates the MRI data acquisition process without compromising the quality of final MR image.
no code implementations • 25 Jan 2018 • Siddique Latif, Muhammad Usman, Rajib Rana, Junaid Qadir
Our choice of RNNs is motivated by the great success of deep learning in medical applications and by the observation that RNNs represent the deep learning configuration most suitable for dealing with sequential or temporal data even in the presence of noise.
1 code implementation • 19 Jan 2018 • Siddique Latif, Rajib Rana, Shahzad Younis, Junaid Qadir, Julien Epps
The majority of existing speech emotion recognition research focuses on automatic emotion detection using training and testing data from same corpus collected under the same conditions.
no code implementations • 16 Jan 2018 • Kazi Nazmul Haque, Mohammad Abu Yousuf, Rajib Rana
In the proposed model, the encoder reads an image and catches the abstraction of that image in a vector, where decoder takes that vector as well as the corrupted image to reconstruct a clean image.
no code implementations • 23 Dec 2017 • Siddique Latif, Rajib Rana, Junaid Qadir, Julien Epps
Inspired by this, we propose VAEs for deriving the latent representation of speech signals and use this representation to classify emotions.
no code implementations • 19 Jan 2015 • Sajib Saha, Frank de Hoog, Ya. I. Nesterets, Rajib Rana, M. Tahtali, T. E. Gureyev
Methods: To better specify the sparsity profile and to ensure efficient source localization, the proposed approach considers grouping of the electrical current dipoles inside human brain.
no code implementations • 5 Sep 2014 • Rajib Rana, Brano Kusy, Josh Wall, Wen Hu
Much of the energy consumption in buildings is due to HVAC systems, which has motivated several recent studies on making these systems more energy- efficient.