Robust Speech Recognition

22 papers with code • 0 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition

yuchen005/dpsl-asr 28 Mar 2022

Then, we propose style learning to map the fused feature close to clean feature, in order to learn latent speech information from the latter, i. e., clean "speech style".

34
28 Mar 2022

Speech-enhanced and Noise-aware Networks for Robust Speech Recognition

sinica-slam/kaldi-senan 25 Mar 2022

In this paper, a noise-aware training framework based on two cascaded neural structures is proposed to jointly optimize speech enhancement and speech recognition.

6
25 Mar 2022

Sequential Randomized Smoothing for Adversarially Robust Speech Recognition

raphaelolivier/smoothingasr EMNLP 2021

We apply adaptive versions of state-of-the-art attacks, such as the Imperceptible ASR attack, to our model, and show that our strongest defense is robust to all attacks that use inaudible noise, and can only be broken with very high distortion.

2
05 Nov 2021

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition

yuchen005/dpsl-asr 11 Oct 2021

Speech enhancement (SE) aims to suppress the additive noise from a noisy speech signal to improve the speech's perceptual quality and intelligibility.

34
11 Oct 2021

An Investigation of End-to-End Models for Robust Speech Recognition

archiki/Robust-E2E-ASR 11 Feb 2021

A systematic comparison of these two approaches for end-to-end robust ASR has not been attempted before.

44
11 Feb 2021

Domain Adaptation Using Class Similarity for Robust Speech Recognition

zhu-han/ASR-Adaption-Class-Similarity 5 Nov 2020

Then, for each class, probabilities of this class are used to compute a mean vector, which we refer to as mean soft labels.

9
05 Nov 2020

Multi-task self-supervised learning for Robust Speech Recognition

santi-pdp/pase 25 Jan 2020

We then propose a revised encoder that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks.

436
25 Jan 2020

Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural Networks

doglic/asr 23 Jun 2019

We investigate the potential of stochastic neural networks for learning effective waveform-based acoustic models.

0
23 Jun 2019

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

vivivic/speech-domain-adaptation-DRL 12 Apr 2019

The latent variables allow us to convert the domain of speech according to its context and domain representation.

16
12 Apr 2019

Scalable Factorized Hierarchical Variational Autoencoder Training

wnhsu/ScalableFHVAE 9 Apr 2018

Deep generative models have achieved great success in unsupervised learning with the ability to capture complex nonlinear relationships between latent generating factors and observations.

52
09 Apr 2018