no code implementations • 12 Dec 2023 • Shengqiang Li, Chao Lei, Baozhong Ma, BinBin Zhang, Fuping Pan
This study describes our system for Task 1 Single-speaker Visual Speech Recognition (VSR) fixed track in the Chinese Continuous Visual Speech Recognition Challenge (CNVSRC) 2023.
no code implementations • 13 Jul 2021 • Shengqiang Li, Menglong Xu, Xiao-Lei Zhang
To make use of the time order of the input sequence, many works inject some information about the relative or absolute position of the element into the input sequence.
no code implementations • 13 Jul 2021 • Menglong Xu, Shengqiang Li, Chengdong Liang, Xiao-Lei Zhang
Deep neural networks provide effective solutions to small-footprint keyword spotting (KWS).
1 code implementation • 28 Mar 2021 • Shanzheng Guan, Shupei Liu, Junqi Chen, Wenbo Zhu, Shengqiang Li, Xu Tan, Ziye Yang, Menglong Xu, Yijiang Chen, Jianyu Wang, Xiao-Lei Zhang
We trained several multi-device speech recognition systems on both the Libri-adhoc40 dataset and a simulated dataset.
1 code implementation • 23 Oct 2020 • Menglong Xu, Shengqiang Li, Xiao-Lei Zhang
To reduce the computational complexity and improve the performance, we further propose local DSA (LDSA) to restrict the attention scope of DSA to a local range around the current central frame for speech recognition.