Music Source Separation

53 papers with code • 3 benchmarks • 7 datasets

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Libraries

Use these libraries to find Music Source Separation models and implementations

A fully differentiable model for unsupervised singing voice separation

pierrechouteau/umss_icassp 30 Jan 2024

A novel model was recently proposed by Schulze-Forster et al. in [1] for unsupervised music source separation.

6
30 Jan 2024

Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models

danjacobellis/mpq 15 Jan 2024

Our results indicate three key findings: (1) using generative compression, it is feasible to leverage highly compressed data while incurring a negligible impact on machine perceptual quality; (2) machine perceptual quality correlates strongly with deep similarity metrics, indicating a crucial role of these metrics in the development of machine-oriented codecs; and (3) using lossy compressed datasets, (e. g. ImageNet) for pre-training can lead to counter-intuitive scenarios where lossy compression increases machine perceptual quality rather than degrading it.

1
15 Jan 2024

Pre-training Music Classification Models via Music Source Separation

FaceOnLive/Spleeter-Android-iOS 24 Oct 2023

In this paper, we study whether music source separation can be used as a pre-training strategy for music representation learning, targeted at music classification tasks.

190
24 Oct 2023

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

FaceOnLive/Spleeter-Android-iOS 15 Sep 2023

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music.

190
15 Sep 2023

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

zfturbo/mvsep-mdx23-music-separation-model 14 Aug 2023

We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce two new datasets that simulate such errors: SDXDB23_LabelNoise and SDXDB23_Bleeding.

483
14 Aug 2023

Sound Demixing Challenge 2023 Music Demixing Track Technical Report: TFC-TDF-UNet v3

kuielab/sdx23 15 Jun 2023

In this report, we present our award-winning solutions for the Music Demixing Track of Sound Demixing Challenge 2023.

56
15 Jun 2023

Quantifying Spatial Audio Quality Impairment

karnwatcharasupat/spauq 13 Jun 2023

Spatial audio quality is a highly multifaceted concept, with many interactions between environmental, geometrical, anatomical, psychological, and contextual considerations.

10
13 Jun 2023

The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation

asteroid-team/asteroid 13 May 2023

We modify the target network, i. e., the network architecture of the original DNN-based MSS, by adding bridging paths for each output instrument to share their information.

2,104
13 May 2023

Hybrid Transformers for Music Source Separation

facebookresearch/demucs 15 Nov 2022

While it performs poorly when trained only on MUSDB, we show that it outperforms Hybrid Demucs (trained on the same data) by 0. 45 dB of SDR when using 800 extra training songs.

7,635
15 Nov 2022

MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation

jeonchangbin49/medleyvox 14 Nov 2022

Second, to overcome the absence of existing multi-singing datasets for a training purpose, we present a strategy for construction of multiple singing mixtures using various single-singing datasets.

63
14 Nov 2022