no code implementations • 13 Jun 2023 • Ji-Sang Hwang, Sang-Hoon Lee, Seong-Whan Lee
Furthermore, we introduce a pause-based word encoder to model word-level prosody based on pause sequence.
no code implementations • 12 Jun 2023 • Ji-Sang Hwang, Sang-Hoon Lee, Seong-Whan Lee
To alleviate the challenges posed by model complexity in singing voice synthesis, we propose HiddenSinger, a high-quality singing voice synthesis system using a neural audio codec and latent diffusion models.