no code implementations • 13 Mar 2024 • ZiQi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu
Recurrent neural networks have become a standard modeling technique for sequential data in TTS systems and are widely used.
no code implementations • 15 Nov 2023 • Jianzong Wang, Yimin Deng, ZiQi Liang, xulong Zhang, Ning Cheng, Jing Xiao
This paper proposes a talking face generation method named "CP-EB" that takes an audio signal as input and a person image as reference, to synthesize a photo-realistic people talking video with head poses controlled by a short video clip and proper eye blinking embedding.
no code implementations • 24 Oct 2022 • ZiQi Liang
The final experimental results show that the TTS model using only the CNN component can reduce the training time compared to the classic TTS models such as Tacotron while ensuring the quality of the synthesized speech.