Search Results for author: Askat Kuzdeuov

Found 1 papers, 1 papers with code

SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams

1 code implementation • 5 Dec 2020 • Madina Abdrakhmanova, Askat Kuzdeuov, Sheikh Jarju, Yerbolat Khassanov, Michael Lewis, Huseyin Atakan Varol

We present SpeakingFaces as a publicly-available large-scale multimodal dataset developed to support machine learning research in contexts that utilize a combination of thermal, visual, and audio data streams; examples include human-computer interaction, biometric authentication, recognition systems, domain transfer, and speech recognition.

speech-recognition Speech Recognition +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.