Search Results for author: Heayoung Park

Found 2 papers, 0 papers with code

FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net Encoder With Multiple STFTs

no code implementations18 May 2023 Won Jang, Dan Lim, Heayoung Park

This paper presents FastFit, a novel neural vocoder architecture that replaces the U-Net encoder with multiple short-time Fourier transforms (STFTs) to achieve faster generation rates without sacrificing sample quality.

Decoder

JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment

no code implementations15 May 2020 Dan Lim, Won Jang, Gyeonghwan O, Heayoung Park, Bong-Wan Kim, Jaesam Yoon

We propose Jointly trained Duration Informed Transformer (JDI-T), a feed-forward Transformer with a duration predictor jointly trained without explicit alignments in order to generate an acoustic feature sequence from an input text.

Cannot find the paper you are looking for? You can Submit a new open access paper.