1 code implementation • 30 Oct 2022 • Jie Wang, Menglong Xu, Jingyong Hou, BinBin Zhang, Xiao-Lei Zhang, Lei Xie, Fuping Pan
Keyword spotting (KWS) enables speech-based user interaction and gradually becomes an indispensable component of smart devices.
no code implementations • 13 Jul 2021 • Shengqiang Li, Menglong Xu, Xiao-Lei Zhang
To make use of the time order of the input sequence, many works inject some information about the relative or absolute position of the element into the input sequence.
no code implementations • 13 Jul 2021 • Menglong Xu, Shengqiang Li, Chengdong Liang, Xiao-Lei Zhang
Deep neural networks provide effective solutions to small-footprint keyword spotting (KWS).
no code implementations • 29 Mar 2021 • Chengdong Liang, Menglong Xu, Xiao-Lei Zhang
Although the performance of the proposed resGSA-Transformer is only slightly better than that of the RPSA-Transformer, it does not have to tune the window length manually.
1 code implementation • 28 Mar 2021 • Shanzheng Guan, Shupei Liu, Junqi Chen, Wenbo Zhu, Shengqiang Li, Xu Tan, Ziye Yang, Menglong Xu, Yijiang Chen, Jianyu Wang, Xiao-Lei Zhang
We trained several multi-device speech recognition systems on both the Libri-adhoc40 dataset and a simulated dataset.
1 code implementation • 23 Oct 2020 • Menglong Xu, Shengqiang Li, Xiao-Lei Zhang
To reduce the computational complexity and improve the performance, we further propose local DSA (LDSA) to restrict the attention scope of DSA to a local range around the current central frame for speech recognition.