Search Results for author: Thomas Arildsen

Found 3 papers, 1 papers with code

Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder

no code implementations • 15 Nov 2022 • Yuying Xie, Thomas Arildsen, Zheng-Hua Tan

For the prior of speaker identity variable, \acrshort{fhvae} assumes it is a Gaussian distribution with an utterance-scale varying mean and a fixed variance.

Contrastive Learning Disentanglement +4

Paper
Add Code

Complex Recurrent Variational Autoencoder with Application to Speech Enhancement

1 code implementation • 5 Apr 2022 • Yuying Xie, Thomas Arildsen, Zheng-Hua Tan

This work proposes a complex recurrent VAE framework, specifically in which complex-valued recurrent neural network and L1 reconstruction loss are used.

Speech Enhancement

Paper
Code

Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective

no code implementations • 5 Apr 2022 • Yuying Xie, Thomas Arildsen, Zheng-Hua Tan

As a self-supervised objective, autoregressive predictive coding (APC), on the other hand, has been used in extracting meaningful and transferable speech features for multiple downstream tasks.

Disentanglement Speaker Recognition +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.