Search Results for author: Zewang Zhang

Found 5 papers, 1 papers with code

WeSinger: Data-augmented Singing Voice Synthesis with Auxiliary Losses

no code implementations21 Mar 2022 Zewang Zhang, Yibin Zheng, Xinhui Li, Li Lu

To improve the accuracy and naturalness of synthesized singing voice, we design several specifical modules and techniques: 1) A deep bi-directional LSTM-based duration model with multi-scale rhythm loss and post-processing step; 2) A Transformer-alike acoustic model with progressive pitch-weighted decoder loss; 3) a 24 kHz pitch-aware LPCNet neural vocoder to produce high-quality singing waveforms; 4) A novel data augmentation method with multi-singer pre-training for stronger robustness and naturalness.

Data Augmentation Decoder +1

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

1 code implementation24 Nov 2020 Qiao Tian, Yi Chen, Zewang Zhang, Heng Lu, LingHui Chen, Lei Xie, Shan Liu

On one hand, we propose to discriminate ground-truth waveform from synthetic one in frequency domain for offering more consistency guarantees instead of only in time domain.

Generative Adversarial Network Speech Synthesis

AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN

no code implementations12 May 2020 Zewang Zhang, Qiao Tian, Heng Lu, Ling-Hui Chen, Shan Liu

This paper investigates how to leverage a DurIAN-based average model to enable a new speaker to have both accurate pronunciation and fluent cross-lingual speaking with very limited monolingual data.

Few-Shot Learning

Composing Music with Grammar Argumented Neural Networks and Note-Level Encoding

no code implementations16 Nov 2016 Zheng Sun, Jiaqi Liu, Zewang Zhang, Jingwen Chen, Zhao Huo, Ching Hua Lee, Xiao Zhang

Creating aesthetically pleasing pieces of art, including music, has been a long-term goal for artificial intelligence research.

Music Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.