TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Enhancement	VoiceBank + DEMAND	MetricGAN+	PESQ	3.15	# 12
Speech Enhancement	VoiceBank + DEMAND	MetricGAN+	CSIG	4.14	# 18
Speech Enhancement	VoiceBank + DEMAND	MetricGAN+	CBAK	3.16	# 18
Speech Enhancement	VoiceBank + DEMAND	MetricGAN+	COVL	3.64	# 15

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/metricgan-an-improved-version-of-metricgan/speech-enhancement-on-demand)](https://paperswithcode.com/sota/speech-enhancement-on-demand?p=metricgan-an-improved-version-of-metricgan)`

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement

8 Apr 2021 · Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao ·

The discrepancy between the cost function used for training a speech enhancement model and human auditory perception usually makes the quality of enhanced speech unsatisfactory. Objective evaluation metrics which consider human perception can hence serve as a bridge to reduce the gap. Our previously proposed MetricGAN was designed to optimize objective metrics by connecting the metric with a discriminator. Because only the scores of the target evaluation functions are needed during training, the metrics can even be non-differentiable. In this study, we propose a MetricGAN+ in which three training techniques incorporating domain-knowledge of speech processing are proposed. With these techniques, experimental results on the VoiceBank-DEMAND dataset show that MetricGAN+ can increase PESQ score by 0.3 compared to the previous MetricGAN and achieve state-of-the-art results (PESQ score = 3.15).

PDF Abstract