Search Results for author: Vincent Colotte

Found 5 papers, 2 papers with code

Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS

1 code implementation28 May 2023 Sewade Ogun, Vincent Colotte, Emmanuel Vincent

Flow-based generative models are widely used in text-to-speech (TTS) systems to learn the distribution of audio features (e. g., Mel-spectrograms) given the input tokens and to sample from this distribution to generate diverse utterances.

Zero-Shot Multi-Speaker TTS

Can we use Common Voice to train a Multi-Speaker TTS system?

1 code implementation12 Oct 2022 Sewade Ogun, Vincent Colotte, Emmanuel Vincent

We show the viability of this approach for training a multi-speaker GlowTTS model on the Common Voice English dataset.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

The IFCASL Corpus of French and German Non-native and Native Read Speech

no code implementations LREC 2016 Juergen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin J{\"u}gler, Yves Laprie, Odile Mella, Bernd M{\"o}bius, Frank Zimmerer

The IFCASL corpus is a French-German bilingual phonetic learner corpus designed, recorded and annotated in a project on individualized feedback in computer-assisted spoken language learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.