Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

Alexandre Défossez; Nicolas Usunier; Léon Bottou; Francis Bach

Pré-Publication, Document De Travail Année : 2019

Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

(1, 2, 3) , (3) , (4) , (2, 5, 1)

1
2
3
4
5

Alexandre Défossez

Fonction : Auteur
PersonId : 15596
IdHAL : alexandre-defossez
ORCID : 0000-0003-3616-1968
IdRef : 260370533

Statistical Machine Learning and Parsimony

Université Paris Sciences et Lettres

Facebook AI Research [Paris]

Nicolas Usunier

Fonction : Auteur
PersonId : 933831

Facebook AI Research [Paris]

Léon Bottou

Fonction : Auteur
PersonId : 920968

Facebook AI Research [New York]

Francis Bach

Fonction : Auteur
PersonId : 863126

Université Paris Sciences et Lettres

Département d'informatique - ENS Paris

Statistical Machine Learning and Parsimony

Résumé

We study the problem of source separation for music using deep learning with four known sources: drums, bass, vocals and other accompaniments. State-of-the-art approaches predict soft masks over mixture spectrograms while methods working on the waveform are lagging behind as measured on the standard MusDB benchmark. Our contribution is two fold. (i) We introduce a simple convolutional and recurrent model that outperforms the state-of-the-art model on waveforms, that is, Wave-U-Net, by 1.6 points of SDR (signal to distortion ratio). (ii) We propose a new scheme to leverage unlabeled music. We train a first model to extract parts with at least one source silent in unlabeled tracks, for instance without bass. We remix this extract with a bass line taken from the supervised dataset to form a new weakly supervised training example. Combining our architecture and scheme, we show that waveform methods can play in the same ballpark as spectrogram ones.

Domaines

Son [cs.SD] Apprentissage [cs.LG] Machine Learning [stat.ML]

Fichier principal

demucs_preprint.pdf (518.05 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alexandre Defossez : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02277338

Soumis le : mardi 3 septembre 2019-15:12:17

Dernière modification le : vendredi 19 avril 2024-16:18:56

Archivage à long terme le : mercredi 5 février 2020-22:33:55

Dates et versions

hal-02277338 , version 1 (03-09-2019)

Identifiants

HAL Id : hal-02277338 , version 1
ARXIV : 1909.01174

Citer

Alexandre Défossez, Nicolas Usunier, Léon Bottou, Francis Bach. Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed. 2019. ⟨hal-02277338⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL

1060 Consultations

1210 Téléchargements

Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager