Search Results for author: Jungil Kong

Found 4 papers, 3 papers with code

Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis

no code implementations20 Nov 2023 Jungil Kong, Junmo Lee, Jeongmin Kim, Beomjeong Kim, Jihoon Park, Dohee Kong, Changheon Lee, Sangjin Kim

To overcome previous limitations, we propose effective methods for feature learning and representing target speakers' speech characteristics by discretizing the features and conditioning them to a speech synthesis model.

Speech Synthesis

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

2 code implementations31 Jul 2023 Jungil Kong, Jihoon Park, Beomjeong Kim, Jeongmin Kim, Dohee Kong, Sangjin Kim

Single-stage text-to-speech models have been actively studied recently, and their results have outperformed two-stage pipeline systems.

Computational Efficiency

Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

5 code implementations NeurIPS 2020 Jaehyeon Kim, Sungwon Kim, Jungil Kong, Sungroh Yoon

By leveraging the properties of flows, MAS searches for the most probable monotonic alignment between text and the latent representation of speech.

Ranked #4 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Text-To-Speech Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.