Singing Voice Synthesis
19 papers with code • 0 benchmarks • 0 datasets
Benchmarks
These leaderboards are used to track progress in Singing Voice Synthesis
Most implemented papers
HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation
Entertainment-oriented singing voice synthesis (SVS) requires a vocoder to generate high-fidelity (e. g. 48kHz) audio.
Xiaoicesing 2: A High-Fidelity Singing Voice Synthesizer Based on Generative Adversarial Network
XiaoiceSing is a singing voice synthesis (SVS) system that aims at generating 48kHz singing voices.
M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus
The lack of publicly available high-quality and accurately labeled datasets has long been a major bottleneck for singing voice synthesis (SVS).
Cross-domain Neural Pitch and Periodicity Estimation
Pitch is a foundational aspect of our perception of audio signals.
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
In this paper, we propose a "Co"nsistency "Mo"del-based "Speech" synthesis method, CoMoSpeech, which achieve speech synthesis through a single diffusion sampling step while achieving high audio quality.
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming
We show the equivalence of the Gibbs distribution to a message-passing algorithm by the properties of the Gumbel distribution and give all the ingredients required for variational Bayesian inference of a latent path, namely Bayesian dynamic programming (BDP).
FSD: An Initial Chinese Dataset for Fake Song Detection
In this paper, we initially construct a Chinese Fake Song Detection (FSD) dataset to investigate the field of song deepfake detection.
SingFake: Singing Voice Deepfake Detection
These unique properties make singing voice deepfake detection a relevant but significantly different problem from synthetic speech detection.
BiSinger: Bilingual Singing Voice Synthesis
We fuse monolingual singing datasets with open-source singing voice conversion techniques to generate bilingual singing voices while also exploring the potential use of bilingual speech data.