no code implementations • 20 Nov 2023 • Jungil Kong, Junmo Lee, Jeongmin Kim, Beomjeong Kim, Jihoon Park, Dohee Kong, Changheon Lee, Sangjin Kim
To overcome previous limitations, we propose effective methods for feature learning and representing target speakers' speech characteristics by discretizing the features and conditioning them to a speech synthesis model.
2 code implementations • 31 Jul 2023 • Jungil Kong, Jihoon Park, Beomjeong Kim, Jeongmin Kim, Dohee Kong, Sangjin Kim
Single-stage text-to-speech models have been actively studied recently, and their results have outperformed two-stage pipeline systems.
10 code implementations • NeurIPS 2020) 2020 • Jungil Kong, Jaehyeon Kim, Jaekyoung Bae
Several recent work on speech synthesis have employed generative adversarial networks (GANs) to produce raw waveforms.
Ranked #10 on Speech Synthesis on LibriTTS
5 code implementations • NeurIPS 2020 • Jaehyeon Kim, Sungwon Kim, Jungil Kong, Sungroh Yoon
By leveraging the properties of flows, MAS searches for the most probable monotonic alignment between text and the latent representation of speech.
Ranked #4 on Text-To-Speech Synthesis on LJSpeech (using extra training data)