Speech Dereverberation

16 papers with code • 4 benchmarks • 5 datasets

Removing reverberation from audio signals

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

sp-uhh/sgmse 22 Dec 2022

As diffusion models are generative approaches they may also produce vocalizing and breathing artifacts in adverse conditions.

383
22 Dec 2022

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

donghoney0416/DeFT-AN 15 Dec 2022

In this study, we propose a dense frequency-time attentive network (DeFT-AN) for multichannel speech enhancement.

17
15 Dec 2022

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration

sp-uhh/sgmse 4 Nov 2022

In this paper, we systematically compare the performance of generative diffusion models and discriminative approaches on different speech restoration tasks.

383
04 Nov 2022

Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation

jwr1995/dtcn 27 Oct 2022

In this work deformable convolution is proposed as a solution to allow TCN models to have dynamic RFs that can adapt to various reverberation times for reverberant speech separation.

15
27 Oct 2022

Speech Dereverberation with a Reverberation Time Shortening Target

Audio-WestlakeU/FullSubNet 20 Oct 2022

The proposed RTS target suppresses reverberation and meanwhile maintains the exponential decaying property of reverberation, which will ease the network training, and thus reduce signal distortion caused by the prediction error.

502
20 Oct 2022

Speech Enhancement and Dereverberation with Diffusion-based Generative Models

sp-uhh/sgmse IEEE/ACM Transactions on Audio, Speech, and Language Processing 2023

This matches our forward process which moves from clean speech to noisy speech by including a drift term.

383
11 Aug 2022

MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes

anton-jeran/FAST-RIR 18 May 2022

We show that the acoustic metrics of the IRs predicted from our MESH2IR match the ground truth with less than 10% error.

137
18 May 2022

Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation

jwr1995/wd-tcn 17 May 2022

It is shown that this weighted multi-dilation temporal convolutional network (WD-TCN) consistently outperforms the TCN across various model configurations and using the WD-TCN model is a more parameter efficient method to improve the performance of the model than increasing the number of convolutional blocks.

8
17 May 2022

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

jwr1995/whamr_ext 13 Apr 2022

A feature of TCNs is that they have a receptive field (RF) dependent on the specific model configuration which determines the number of input frames that can be observed to produce an individual output frame.

5
13 Apr 2022