Speech Enhancement

218 papers with code • 12 benchmarks • 19 datasets

Speech Enhancement is a signal processing task that involves improving the quality of speech signals captured under noisy or degraded conditions. The goal of speech enhancement is to make speech signals clearer, more intelligible, and more pleasant to listen to, which can be used for various applications such as voice recognition, teleconferencing, and hearing aids.

( Image credit: A Fully Convolutional Neural Network For Speech Enhancement )

Benchmarks

Add a Result

These leaderboards are used to track progress in Speech Enhancement

Dataset	Best Model	Compare
VoiceBank + DEMAND	MP-SENet	See all
Deep Noise Suppression (DNS) Challenge	MP-SENet	See all
CHiME-3	Inter-Channel Conv-TasNet	See all
EasyCom	MaxDI (Baseline)	See all
DNS Challenge	DCUnet-MC	See all
WHAMR!	SepFormer	See all
WSJ0 + DEMAND + RNNoise	DCUNet-MC	See all
GRID corpus (mixed-speech)	Audio-Visual concat-ref	See all
TCD-TIMIT corpus (mixed-speech)	Audio-Visual concat-ref	See all
LibriSpeechDuplicate	SE-MelGAN	See all
WHAM!	SepFormer	See all
spatialized DNS challenge	DeFT-AN	See all

Show all 12 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Speech Enhancement models and implementations

rikorose/deepfilternet

4 papers

1,939

microsoft/DNS-Challenge

4 papers

973

anicolson/DeepXi

4 papers

485

espnet/espnet

3 papers

7,892

See all 10 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

ICASSP 2023 Acoustic Echo Cancellation Challenge

microsoft/AEC-Challenge • 22 Sep 2023

This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as well as including a full-band version of AECMOS.

341

22 Sep 2023

Paper
Code

Unsupervised speech enhancement with diffusion-based generative models

sp-uhh/sgmse • • 19 Sep 2023

To address this issue, we introduce an alternative approach that operates in an unsupervised manner, leveraging the generative power of diffusion models.

392

19 Sep 2023

Paper
Code

Single and Few-step Diffusion for Generative Speech Enhancement

sp-uhh/sgmse_crp • • 18 Sep 2023

While the performance of usual generative diffusion algorithms drops dramatically when lowering the number of function evaluations (NFEs) to obtain single-step diffusion, we show that our proposed method keeps a steady performance and therefore largely outperforms the diffusion baseline in this setting and also generalizes better than its predictive counterpart.

18 Sep 2023

Paper
Code

Multi-dimensional Speech Quality Assessment in Crowdsourcing

microsoft/P.808 • 14 Sep 2023

The commonly used standard ITU-T Rec.

188

14 Sep 2023

Paper
Code

Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models

wngh1187/diff-sv • • 14 Sep 2023

Diff-SV unifies a DPM-based speech enhancement system with a speaker embedding extractor, and yields a discriminative and noise-tolerable speaker representation through a hierarchical structure.

14 Sep 2023

Paper
Code

Gray Jedi MVDR Post-filtering

FrancoisGrondin/mvdrpf • • 10 Sep 2023

Spatial filters can exploit deep-learning-based speech enhancement models to increase their reliability in scenarios with multiple speech sources scenarios.

10 Sep 2023

Paper
Code

Simulating room transfer functions between transducers mounted on audio devices using a modified image source method

audiolabs/DEISM • 7 Sep 2023

The image source method (ISM) is often used to simulate room acoustics due to its ease of use and computational efficiency.

07 Sep 2023

Paper
Code

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

yxlu-0102/MP-SENet • • 17 Aug 2023

Compared to existing phase-aware speech enhancement methods, it further mitigates the compensation effect between the magnitude and phase by explicit phase estimation, elevating the perceptual quality of enhanced speech.

192

17 Aug 2023

Paper
Code

Separate Anything You Describe

audio-agi/audiosep • • 9 Aug 2023

In this work, we introduce AudioSep, a foundation model for open-domain audio source separation with natural language queries.

1,434

09 Aug 2023

Paper
Code

The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions

leto19/commonvoice-demand • 27 Jul 2023

In this work, SE models are trained and tested on a number of different languages, with self-supervised representations which themselves are trained using different language combinations and with differing network structures as loss function representations.

27 Jul 2023

Paper
Code

Speech Enhancement

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result