Search Results for author: Zhen Ye

Found 7 papers, 3 papers with code

FlashSpeech: Efficient Zero-Shot Speech Synthesis

1 code implementation • 23 Apr 2024 • Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei Xue

The generation processes of FlashSpeech can be achieved efficiently with one or two sampling steps while maintaining high audio quality and high similarity to the audio prompt for zero-shot speech generation.

Speech Synthesis Voice Conversion

142

Paper
Code

CoMoSVC: Consistency Model-based Singing Voice Conversion

no code implementations • 3 Jan 2024 • Yiwen Lu, Zhen Ye, Wei Xue, Xu Tan, Qifeng Liu, Yike Guo

The diffusion-based Singing Voice Conversion (SVC) methods have achieved remarkable performances, producing natural audios with high similarity to the target timbre.

Voice Conversion

Paper
Add Code

NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound Synthesis based on Frequency Modulation

no code implementations • 22 May 2023 • Zhen Ye, Wei Xue, Xu Tan, Qifeng Liu, Yike Guo

Since expert knowledge is hard to acquire, it hinders the flexibility to quickly design and tune digital synthesizers for diverse sounds.

Neural Architecture Search

Paper
Add Code

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

1 code implementation • 11 May 2023 • Zhen Ye, Wei Xue, Xu Tan, Jie Chen, Qifeng Liu, Yike Guo

In this paper, we propose a "Co"nsistency "Mo"del-based "Speech" synthesis method, CoMoSpeech, which achieve speech synthesis through a single diffusion sampling step while achieving high audio quality.

Denoising Singing Voice Synthesis +1

142

Paper
Code

Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features

no code implementations • 5 May 2021 • Rong Huang, Wei Yao, Yusheng Xu, Zhen Ye, Uwe Stilla

Registration is a fundamental but critical task in point cloud processing, which usually depends on finding element correspondence from two point clouds.

Graph Matching Point Cloud Registration +1

Paper
Add Code

Financial risk prediction with multi-round QA attention network. International Joint Conference on Artificial Intelligence

no code implementations • IJCAI 2021 • Zhen Ye, Yu Qin and Wei Xu∗

Financial risk prediction with multi-round QA attention network.

Paper
Add Code

BLVD: Building A Large-scale 5D Semantics Benchmark for Autonomous Driving

1 code implementation • 15 Mar 2019 • Jianru Xue, Jianwu Fang, Tao Li, Bohua Zhang, Pu Zhang, Zhen Ye, Jian Dou

Instead, BLVD aims to provide a platform for the tasks of dynamic 4D (3D+temporal) tracking, 5D (4D+interactive) interactive event recognition and intention prediction.

Autonomous Driving Instance Segmentation +5

165

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.