TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Separation	WHAMR!	Wavesplit	SI-SDRi	13.2	# 6
Speech Separation	WSJ0-2mix	Wavesplit v2	SI-SDRi	22.2	# 9
Speech Separation	WSJ0-2mix	Wavesplit v2	SDRi	22.3	# 2
Speech Separation	WSJ0-2mix	Wavesplit v1	SI-SDRi	19.0	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavesplit-end-to-end-speech-separation-by/speech-separation-on-whamr)](https://paperswithcode.com/sota/speech-separation-on-whamr?p=wavesplit-end-to-end-speech-separation-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavesplit-end-to-end-speech-separation-by/speech-separation-on-wsj0-2mix)](https://paperswithcode.com/sota/speech-separation-on-wsj0-2mix?p=wavesplit-end-to-end-speech-separation-by)`

Wavesplit: End-to-End Speech Separation by Speaker Clustering

20 Feb 2020 · Neil Zeghidour, David Grangier ·

We introduce Wavesplit, an end-to-end source separation system. From a single mixture, the model infers a representation for each source and then estimates each source signal given the inferred representations. The model is trained to jointly perform both tasks from the raw waveform. Wavesplit infers a set of source representations via clustering, which addresses the fundamental permutation problem of separation. For speech separation, our sequence-wide speaker representations provide a more robust separation of long, challenging recordings compared to prior work. Wavesplit redefines the state-of-the-art on clean mixtures of 2 or 3 speakers (WSJ0-2/3mix), as well as in noisy and reverberated settings (WHAM/WHAMR). We also set a new benchmark on the recent LibriMix dataset. Finally, we show that Wavesplit is also applicable to other domains, by separating fetal and maternal heart rates from a single abdominal electrocardiogram.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Clustering

Data Augmentation

Speech Separation

Datasets

WSJ0-2mix WHAM! LibriMix WHAMR!

Results from the Paper

Edit

Ranked #6 on Speech Separation on WHAMR!

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Separation	WHAMR!	Wavesplit	SI-SDRi	13.2	# 6	Compare
Speech Separation	WSJ0-2mix	Wavesplit v2	SI-SDRi	22.2	# 9	Compare
Speech Separation	WSJ0-2mix	Wavesplit v2	SDRi	22.3	# 2	Compare
Speech Separation	WSJ0-2mix	Wavesplit v1	SI-SDRi	19.0	# 18	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Wavesplit: End-to-End Speech Separation by Speaker Clustering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove