Speech

Expressive Speech Synthesis

11 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Expressive Speech Synthesis

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Latest papers

Most implemented Social Latest No code

DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training

hsoh0306/diffprosody • • 31 Jul 2023

Expressive text-to-speech systems have undergone significant advancements owing to prosody modeling, but conventional methods can still be improved.

31 Jul 2023

Paper
Code

SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer

0913ktg/sc_vall-e • • 20 Jul 2023

Expressive speech synthesis models are trained by adding corpora with diverse speakers, various emotions, and different speaking styles to the dataset, in order to control various characteristics of speech and generate the desired voice.

129

20 Jul 2023

Paper
Code

EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels

knoriy/emns-dct • 22 May 2023

The increasing adoption of text-to-speech technologies has led to a growing demand for natural and emotive voices that adapt to a conversation's context and emotional tone.

22 May 2023

Paper
Code

Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert

serp-ai/bark-with-voice-clone • • Social Science Research Network (SSRN) 2023

Keywords: Bark, ai voice cloning, Suno, text-to-speech, artificial intelligence, audio generation, Meta's encodec, audio codebooks, semantic tokens, HuBert, transformer-based model, multilingual speech, wav2vec, linear projection head, embedding space, generative capabilities, pretrained model checkpoints

2,805

18 Apr 2023

Paper
Code

Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

keonlee9420/Cross-Speaker-Emotion-Transfer • • 8 Oct 2021

In expressive speech synthesis, there are high requirements for emotion interpretation.

171

08 Oct 2021

Paper
Code

Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning

numediart/LaughterSynthesis • 20 Aug 2020

Despite the growing interest for expressive speech synthesis, synthesis of nonverbal expressions is an under-explored area.

20 Aug 2020

Paper
Code

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

jasminsternkopf/mel_cepstral_distance • 8 Jun 2019

Recent work has explored sequence-to-sequence latent variable models for expressive speech synthesis (supporting control and transfer of prosody and style), but has not presented a coherent framework for understanding the trade-offs between the competing methods.

08 Jun 2019

Paper
Code