no code implementations • 1 Aug 2023 • Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar
This model employs an intuitive approach for the combination of audio-image and video modalities, with a primary aim to escalate the effectiveness of multimodal human action recognition (MHAR).
no code implementations • IEEE International Conference on Visual Communications and Image Processing (VCIP) 2023 • Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar
Currently, action recognition is predominately performed on video data as processed by CNNs.