no code implementations • 1 Dec 2020 • Weicheng Cai, Ming Li
This paper proposes a unified deep speaker embedding framework for modeling speech data with different sampling rates.
no code implementations • 23 Feb 2020 • Qingjian Lin, Weicheng Cai, Lin Yang, Jun-Jie Wang, Jun Zhang, Ming Li
Our diarization system includes multiple modules, namely voice activity detection (VAD), segmentation, speaker embedding extraction, similarity scoring, clustering, resegmentation and overlap detection.
no code implementations • 5 Jul 2019 • Weicheng Cai, Haiwei Wu, Danwei Cai, Ming Li
This paper describes our DKU replay detection system for the ASVspoof 2019 challenge.
no code implementations • 20 Feb 2019 • Weicheng Cai, Danwei Cai, Shen Huang, Ming Li
In this paper, we present an end-to-end language identification framework, the attention-based Convolutional Neural Network-Bidirectional Long-short Term Memory (CNN-BLSTM).
1 code implementation • 9 Sep 2018 • Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li
In this paper, we apply the NetFV and NetVLAD layers for the end-to-end language identification task.
1 code implementation • 14 Apr 2018 • Weicheng Cai, Jinkun Chen, Ming Li
In the end-to-end system, the encoding layer plays a role in aggregating the variable-length input sequence into an utterance level representation.
no code implementations • 2 Apr 2018 • Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li
A novel learnable dictionary encoding layer is proposed in this paper for end-to-end language identification.
no code implementations • 2 Apr 2018 • Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li
After comparing with the state-of-the-art GMM i-vector methods, we give insights into CNN, and reveal its role and effect in the whole pipeline.
no code implementations • 24 Jul 2015 • Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Ming Li
In order to detect these spoofed speech signals as a countermeasure, we propose a score level fusion approach with several different i-vector subsystems.