Whispered and Lombard Neural Speech Synthesis
Qiong Hu
•
Tobias Bleisch
•
Petko Petkov
•
Tuomo Raitio
•
Erik Marchi
•
Varun Lakshminarasimhan
|
2021-01-13
|
Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech
Anonymous
|
2021-01-01
|
Parallel WaveNet conditioned on VAE latent vectors
Jonas Rohnke
•
Tom Merritt
•
Jaime Lorenzo-Trueba
•
Adam Gabrys
•
Vatsal Aggarwal
•
Alexis Moinet
•
Roberto Barra-Chicote
|
2020-12-17
|
Using previous acoustic context to improve Text-to-Speech synthesis
Pilar Oplustil-Gallegos
•
Simon King
|
2020-12-07
|
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement
Hamed Hemati
•
Damian Borth
|
2020-11-12
|
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Ron J. Weiss
•
RJ Skerry-Ryan
•
Eric Battenberg
•
Soroosh Mariooryad
•
Diederik P. Kingma
|
2020-11-06
|
Grapheme or phoneme? An Analysis of Tacotron's Embedded Representations
Antoine Perquin
•
Erica Cooper
•
Junichi Yamagishi
|
2020-10-21
|
Learning Speaker Embedding from Text-to-Speech
|
Jaejin Cho
•
Piotr Zelasko
•
Jesus Villalba
•
Shinji Watanabe
•
Najim Dehak
|
2020-10-21
|
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Jonathan Shen
•
Ye Jia
•
Mike Chrzanowski
•
Yu Zhang
•
Isaac Elias
•
Heiga Zen
•
Yonghui Wu
|
2020-10-08
|
Controllable neural text-to-speech synthesis using intuitive prosodic features
Tuomo Raitio
•
Ramya Rasipuram
•
Dan Castellani
|
2020-09-14
|
Corrective feedback, emphatic speech synthesis, visual-speech exaggeration, pronunciation learning
Yaohua Bu
•
Weijun Li
•
Tianyi Ma
•
Shengqi Chen
•
Jia Jia
•
Kun Li
•
Xiaobo Lu
|
2020-09-12
|
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
|
Dipjyoti Paul
•
Muhammed PV Shifas
•
Yannis Pantazis
•
Yannis Stylianou
|
2020-08-13
|
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS
Rui Liu
•
Berrak Sisman
•
Feilong Bao
•
Guanglai Gao
•
Haizhou Li
|
2020-08-11
|
SpeedySpeech: Efficient Neural Speech Synthesis
|
Jan Vainer
•
Ondřej Dušek
|
2020-08-09
|
One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
|
Tomáš Nekvinda
•
Ondřej Dušek
|
2020-08-03
|
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis
Yusuke Yasuda
•
Xin Wang
•
Junichi Yamagishi
|
2020-05-20
|
End-To-End Speech Synthesis Applied to Brazilian Portuguese
|
Edresson Casanova
•
Arnaldo Candido Junior
•
Christopher Shulby
•
Frederico Santos de Oliveira
•
João Paulo Teixeira
•
Moacir Antonelli Ponti
•
Sandra Maria Aluisio
|
2020-05-11
|
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders
Yu Gu
•
Xiang Yin
•
Yonghui Rao
•
Yuan Wan
•
Benlai Tang
•
Yang Zhang
•
Jitong Chen
•
Yuxuan Wang
•
Zejun Ma
|
2020-04-23
|
A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis
Junjie Pan
•
Xiang Yin
•
Zhiling Zhang
•
Shichao Liu
•
Yang Zhang
•
Zejun Ma
•
Yuxuan Wang
|
2019-11-11
|
Speech Recognition with Augmented Synthesized Speech
Andrew Rosenberg
•
Yu Zhang
•
Bhuvana Ramabhadran
•
Ye Jia
•
Pedro Moreno
•
Yonghui Wu
•
Zelin Wu
|
2019-09-25
|
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
|
Yu Zhang
•
Ron J. Weiss
•
Heiga Zen
•
Yonghui Wu
•
Zhifeng Chen
•
RJ Skerry-Ryan
•
Ye Jia
•
Andrew Rosenberg
•
Bhuvana Ramabhadran
|
2019-07-09
|
A New GAN-based End-to-End TTS Training Algorithm
Haohan Guo
•
Frank K. Soong
•
Lei He
•
Lei Xie
|
2019-04-09
|
Taco-VC: A Single Speaker Tacotron based Voice Conversion with Limited Data
Roee Levy Leshem
•
Raja Giryes
|
2019-04-06
|
Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis
Yanyao Bian
•
Changbin Chen
•
Yongguo Kang
•
Zhenglin Pan
|
2019-04-04
|
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet
Mingyang Zhang
•
Xin Wang
•
Fuming Fang
•
Haizhou Li
•
Junichi Yamagishi
|
2019-03-29
|
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
|
Yusuke Yasuda
•
Xin Wang
•
Shinji Takaki
•
Junichi Yamagishi
|
2018-10-29
|
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Yu-An Chung
•
Yuxuan Wang
•
Wei-Ning Hsu
•
Yu Zhang
•
RJ Skerry-Ryan
|
2018-08-30
|
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
Daisy Stanton
•
Yuxuan Wang
•
RJ Skerry-Ryan
|
2018-08-04
|
Voice Imitating Text-to-Speech Neural Networks
Young-Gun Lee
•
Taesu Kim
•
Soo-Young Lee
|
2018-06-04
|
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
|
RJ Skerry-Ryan
•
Eric Battenberg
•
Ying Xiao
•
Yuxuan Wang
•
Daisy Stanton
•
Joel Shor
•
Ron J. Weiss
•
Rob Clark
•
Rif A. Saurous
|
2018-03-24
|
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
|
Yuxuan Wang
•
Daisy Stanton
•
Yu Zhang
•
RJ Skerry-Ryan
•
Eric Battenberg
•
Joel Shor
•
Ying Xiao
•
Fei Ren
•
Ye Jia
•
Rif A. Saurous
|
2018-03-23
|
Emotional End-to-End Neural Speech Synthesizer
|
Young-Gun Lee
•
Azam Rabiee
•
Soo-Young Lee
|
2017-11-15
|
Uncovering Latent Style Factors for Expressive Speech Synthesis
Yuxuan Wang
•
RJ Skerry-Ryan
•
Ying Xiao
•
Daisy Stanton
•
Joel Shor
•
Eric Battenberg
•
Rob Clark
•
Rif A. Saurous
|
2017-11-01
|
Deep Voice 2: Multi-Speaker Neural Text-to-Speech
|
Sercan Arik
•
Gregory Diamos
•
Andrew Gibiansky
•
John Miller
•
Kainan Peng
•
Wei Ping
•
Jonathan Raiman
•
Yanqi Zhou
|
2017-05-24
|
Tacotron: Towards End-to-End Speech Synthesis
|
Yuxuan Wang
•
RJ Skerry-Ryan
•
Daisy Stanton
•
Yonghui Wu
•
Ron J. Weiss
•
Navdeep Jaitly
•
Zongheng Yang
•
Ying Xiao
•
Zhifeng Chen
•
Samy Bengio
•
Quoc Le
•
Yannis Agiomyrgiannakis
•
Rob Clark
•
Rif A. Saurous
|
2017-03-29
|