Search Results for author: Dengfeng Ke

Found 12 papers, 5 papers with code

Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis

no code implementations5 Jun 2023 Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin

Regressive Text-to-Speech (TTS) system utilizes attention mechanism to generate alignment between text and acoustic feature sequence.

Sentence Speech Synthesis

Text-Aware End-to-end Mispronunciation Detection and Diagnosis

1 code implementation15 Jun 2022 Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang

In the field of assessing the pronunciation quality of constrained speech, the given transcriptions can play the role of a teacher.

Contrastive Learning

An Empirical Study on End-to-End Singing Voice Synthesis with Encoder-Decoder Architectures

no code implementations6 Aug 2021 Dengfeng Ke, Yuxing Lu, Xudong Liu, Yanyan Xu, Jing Sun, Cheng-Hao Cai

With the rapid development of neural network architectures and speech processing models, singing voice synthesis with neural networks is becoming the cutting-edge technique of digital music production.

Singing Voice Synthesis

Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU

no code implementations6 May 2021 Dengfeng Ke, Jinsong Zhang, Yanlu Xie, Yanyan Xu, Binghuai Lin

With all these modifications, the size of the PHASEN model is shrunk from 33M parameters to 5M parameters, while the performance on VoiceBank+DEMAND is improved to the CSIG score of 4. 30, the PESQ score of 3. 07 and the COVL score of 3. 73.

Speech Enhancement

A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques

1 code implementation17 Apr 2021 Kaiqi Fu, Jones Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang, Binghuai Lin

Recently, end-to-end mispronunciation detection and diagnosis (MD&D) systems has become a popular alternative to greatly simplify the model-building process of conventional hybrid DNN-HMM systems by representing complicated modules with a single deep network architecture.

Data Augmentation

Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection

1 code implementation25 Jun 2020 Yongqiang Dou, Haocheng Yang, Maolin Yang, Yanyan Xu, Dengfeng Ke

Besides, in the experiments, we select three kinds of features that contain both magnitude-based and phase-based information to form complementary and informative features.

Binary Classification Speaker Verification

Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment Analysis

3 code implementations17 Apr 2019 Feiyang Chen, Ziqian Luo, Yanyan Xu, Dengfeng Ke

Therefore, in this paper, based on audio and text, we consider the task of multimodal sentiment analysis and propose a novel fusion strategy including both multi-feature fusion and multi-modality fusion to improve the accuracy of audio-text sentiment analysis.

Multimodal Emotion Recognition Multimodal Sentiment Analysis

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training

no code implementations2 May 2018 Bin Liu, Shuai Nie, Yaping Zhang, Dengfeng Ke, Shan Liang, Wenju Liu1

In realistic environments, speech is usually interfered by various noise and reverberation, which dramatically degrades the performance of automatic speech recognition (ASR) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Trainable back-propagated functional transfer matrices

1 code implementation28 Oct 2017 Cheng-Hao Cai, Yanyan Xu, Dengfeng Ke, Kaile Su, Jing Sun

In experiments, it is demonstrated that the revised rules can be used to train a range of functional connections: 20 different functions are applied to neural networks with up to 10 hidden layers, and most of them gain high test accuracies on the MNIST database.

Learning of Human-like Algebraic Reasoning Using Deep Feedforward Neural Networks

no code implementations25 Apr 2017 Cheng-Hao Cai, Dengfeng Ke, Yanyan Xu, Kaile Su

Briefly, in a reasoning system, a deep feedforward neural network is used to guide rewriting processes after learning from algebraic reasoning examples produced by humans.

Cannot find the paper you are looking for? You can Submit a new open access paper.