Open-Unmix - A Reference Implementation for Music Source Separation
Music source separation is the task of decomposing music into its constitutive components,e.g., yielding separated stems for the vocals, bass, and drums. Such a separation has manyapplications ranging from rearranging/repurposing the stems (remixing, repanning, upmixing)to full extraction (karaoke, sample creation, audio restoration). Music separation has a longhistory of scientific activity as it is known to be a very challenging problem. In recent years,deep learning-based systems - for the first time - yielded high-quality separations that alsolead to increased commercial interest. However, until now, no open-source implementationthat achieves state-of-the-art results is available.Open-Unmixcloses this gap by providinga reference implementation based on deep neural networks. It serves two main purposes.Firstly, to accelerate academic research asOpen-Unmixprovides implementations for themost popular deep learning frameworks, giving researchers a flexible way to reproduce results.Secondly, we provide a pre-trained model for end users and even artists to try and use sourceseparation. Furthermore, we designedOpen-Unmixto be one core component in an openecosystem on music separation, where we already provide open datasets, software utilities,and open evaluation to foster reproducible research as the basis of future development.
PDF AbstractCode
Tasks
Datasets
Results from the Paper
Ranked #15 on Music Source Separation on MUSDB18 (using extra training data)
Task | Dataset | Model | Metric Name | Metric Value | Global Rank | Uses Extra Training Data |
Benchmark |
---|---|---|---|---|---|---|---|
Music Source Separation | MUSDB18 | UMXL | SDR (vocals) | 7.213 | # 15 | ||
SDR (drums) | 7.148 | # 12 | |||||
SDR (other) | 4.889 | # 12 | |||||
SDR (bass) | 6.015 | # 14 | |||||
SDR (avg) | 6.316 | # 15 | |||||
Music Source Separation | MUSDB18 | UMX | SDR (vocals) | 6.32 | # 25 | ||
SDR (drums) | 5.73 | # 24 | |||||
SDR (other) | 4.02 | # 23 | |||||
SDR (bass) | 5.23 | # 25 | |||||
SDR (avg) | 5.33 | # 25 |