Search Results for author: Jinlong Xue

Found 7 papers, 3 papers with code

Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation

1 code implementation2 Jan 2024 Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li

Drawing inspiration from state-of-the-art Text-to-Image (T2I) diffusion models, we introduce Auffusion, a TTA system adapting T2I model frameworks to TTA task, by effectively leveraging their inherent generative strengths and precise cross-modal alignment.

Audio Generation Style Transfer

Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis

no code implementations5 Jun 2023 Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin

Regressive Text-to-Speech (TTS) system utilizes attention mechanism to generate alignment between text and acoustic feature sequence.

Sentence Speech Synthesis

M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis

no code implementations3 May 2023 Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, JianHua Tao, Jianqing Sun, Jiaen Liang

However, it is still a challenge to comprehensively model the conversation, and a majority of conversational TTS systems only focus on extracting global information and omit local prosody features, which contain important fine-grained information like keywords and emphasis.

Speech Synthesis Text-To-Speech Synthesis

A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis

no code implementations7 Oct 2022 Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang

Then we used keypoint decomposition to extract video synthesis controlling parameters from the backend output and the source image.

ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis

1 code implementation20 Mar 2022 Jinlong Xue, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang

In recent years, neural network based methods for multi-speaker text-to-speech synthesis (TTS) have made significant progress.

Speaker Verification Speech Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.