Search Results for author: Vatsal Aggarwal

Found 4 papers, 1 papers with code

Parallel WaveNet conditioned on VAE latent vectors

no code implementations • 17 Dec 2020 • Jonas Rohnke, Tom Merritt, Jaime Lorenzo-Trueba, Adam Gabrys, Vatsal Aggarwal, Alexis Moinet, Roberto Barra-Chicote

In this paper we investigate the use of a sentence-level conditioning vector to improve the signal quality of a Parallel WaveNet neural vocoder.

Sentence Speech Synthesis +1

Paper
Add Code

BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization

no code implementations • 4 Feb 2020 • Henry B. Moss, Vatsal Aggarwal, Nishant Prateek, Javier González, Roberto Barra-Chicote

We present BOFFIN TTS (Bayesian Optimization For FIne-tuning Neural Text To Speech), a novel approach for few-shot speaker adaptation.

Bayesian Optimization

Paper
Add Code

Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech

no code implementations • 28 Nov 2019 • Vatsal Aggarwal, Marius Cotescu, Nishant Prateek, Jaime Lorenzo-Trueba, Roberto Barra-Chicote

We propose a Text-to-Speech method to create an unseen expressive style using one utterance of expressive speech of around one second.

Disentanglement Expressive Speech Synthesis +1

Paper
Add Code

Towards achieving robust universal neural vocoding

1 code implementation • 4 Jul 2019 • Jaime Lorenzo-Trueba, Thomas Drugman, Javier Latorre, Thomas Merritt, Bartosz Putrycz, Roberto Barra-Chicote, Alexis Moinet, Vatsal Aggarwal

This vocoder is shown to be capable of generating speech of consistently good quality (98% relative mean MUSHRA when compared to natural speech) regardless of whether the input spectrogram comes from a speaker or style seen during training or from an out-of-domain scenario when the recording conditions are studio-quality.

234

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.