1 code implementation • 28 May 2023 • Sewade Ogun, Vincent Colotte, Emmanuel Vincent
Flow-based generative models are widely used in text-to-speech (TTS) systems to learn the distribution of audio features (e. g., Mel-spectrograms) given the input tokens and to sample from this distribution to generate diverse utterances.
1 code implementation • 12 Oct 2022 • Sewade Ogun, Vincent Colotte, Emmanuel Vincent
We show the viability of this approach for training a multi-speaker GlowTTS model on the Common Voice English dataset.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1