In this paper, we propose a "Co"nsistency "Mo"del-based "Speech" synthesis method, CoMoSpeech, which achieve speech synthesis through a single diffusion sampling step while achieving high audio quality.

Paper
Code

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

Berthaniu/LatentOptimalPathsBayesianDP • • 5 Jun 2023

We show the equivalence of the Gibbs distribution to a message-passing algorithm by the properties of the Gumbel distribution and give all the ingredients required for variational Bayesian inference of a latent path, namely Bayesian dynamic programming (BDP).

Paper
Code

FSD: An Initial Chinese Dataset for Fake Song Detection

xieyuankun/fsd-dataset • 5 Sep 2023

In this paper, we initially construct a Chinese Fake Song Detection (FSD) dataset to investigate the field of song deepfake detection.

Paper
Code

SingFake: Singing Voice Deepfake Detection

yongyizang/SingFake • • 14 Sep 2023

These unique properties make singing voice deepfake detection a relevant but significantly different problem from synthetic speech detection.

Paper
Code

BiSinger: Bilingual Singing Voice Synthesis

BiSinger-SVS/BiSinger • • 25 Sep 2023

We fuse monolingual singing datasets with open-source singing voice conversion techniques to generate bilingual singing voices while also exploring the potential use of bilingual speech data.

Paper
Code

Singing Voice Synthesis

Benchmarks Add a Result

Most implemented papers

Content

Benchmarks

Add a Result