Singing Voice Synthesis

19 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation

zengchang233/xiaoicesing2 23 Oct 2022

Entertainment-oriented singing voice synthesis (SVS) requires a vocoder to generate high-fidelity (e. g. 48kHz) audio.

Xiaoicesing 2: A High-Fidelity Singing Voice Synthesizer Based on Generative Adversarial Network

zengchang233/xiaoicesing2 Interspeech 2023

XiaoiceSing is a singing voice synthesis (SVS) system that aims at generating 48kHz singing voices.

M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus

m4singer/m4singer NIPS 2022

The lack of publicly available high-quality and accurately labeled datasets has long been a major bottleneck for singing voice synthesis (SVS).

Cross-domain Neural Pitch and Periodicity Estimation

interactiveaudiolab/penn 28 Jan 2023

Pitch is a foundational aspect of our perception of audio signals.

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

zhenye234/CoMoSpeech 11 May 2023

In this paper, we propose a "Co"nsistency "Mo"del-based "Speech" synthesis method, CoMoSpeech, which achieve speech synthesis through a single diffusion sampling step while achieving high audio quality.

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

Berthaniu/LatentOptimalPathsBayesianDP 5 Jun 2023

We show the equivalence of the Gibbs distribution to a message-passing algorithm by the properties of the Gumbel distribution and give all the ingredients required for variational Bayesian inference of a latent path, namely Bayesian dynamic programming (BDP).

FSD: An Initial Chinese Dataset for Fake Song Detection

xieyuankun/fsd-dataset 5 Sep 2023

In this paper, we initially construct a Chinese Fake Song Detection (FSD) dataset to investigate the field of song deepfake detection.

SingFake: Singing Voice Deepfake Detection

yongyizang/SingFake 14 Sep 2023

These unique properties make singing voice deepfake detection a relevant but significantly different problem from synthetic speech detection.

BiSinger: Bilingual Singing Voice Synthesis

BiSinger-SVS/BiSinger 25 Sep 2023

We fuse monolingual singing datasets with open-source singing voice conversion techniques to generate bilingual singing voices while also exploring the potential use of bilingual speech data.