no code implementations • 16 Mar 2022 • Mashrur M. Morshed, Ahmad Omar Ahsan, Hasan Mahmud, Md. Kamrul Hasan
In this paper, we propose an efficient MLP-based approach for learning audio representations, namely timestamp and scene-level audio embeddings.
1 code implementation • 14 Oct 2021 • Mashrur M. Morshed, Ahmad Omar Ahsan
Till now, attention-based models have been used with great success in the keyword spotting problem domain.
Ranked #8 on Keyword Spotting on Google Speech Commands (Google Speech Commands V2 35 metric)
1 code implementation • 6 Jul 2021 • Hasan Mahmud, Mashrur M. Morshed, Md. Kamrul Hasan
In this paper, we revisit this approach to hand gesture recognition and suggest several improvements.