Skip to Main content Skip to Navigation
Conference papers

Semi-supervised {NMF} with time-frequency annotations for single-channel source separation

Augustin Lefèvre 1 Francis Bach 1, 2 Cédric Févotte 3
1 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique de l'École normale supérieure, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : We formulate a novel extension of nonnegative matrix factorization (NMF) to take into account partial information on source-specific activity in the spectrogram. This information comes in the form of masking coefficients, such as those found in an ideal binary mask. We show that state-of-the-art results in source separation may be achieved with only a limited amount of correct annotation, and furthermore our algorithm is robust to incorrect annotations. Since in practice ideal annotations are not observed, we propose several supervision scenarios to estimate the ideal mask- ing coefficients. First, manual annotations by a trained user on a dedicated graphical user interface are shown to provide satisfactory performance although they are prone to errors. Second, we investigate simple learning strate- gies to predict the Wiener coefficients based on local information around a given time-frequency bin of the spec- trogram. Results on single-channel source separation show that time-frequency annotations allow to disambiguate the source separation problem, and learned annotations open the way for a completely unsupervised learning procedure for source separation with no human intervention.
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download
Contributor : Augustin Lefèvre <>
Submitted on : Thursday, July 12, 2012 - 4:45:21 PM
Last modification on : Tuesday, May 4, 2021 - 2:06:02 PM
Long-term archiving on: : Saturday, October 13, 2012 - 2:35:51 AM


Files produced by the author(s)


  • HAL Id : hal-00717366, version 1


Augustin Lefèvre, Francis Bach, Cédric Févotte. Semi-supervised {NMF} with time-frequency annotations for single-channel source separation. ISMIR 2012 : 13th International Society for Music Information Retrieval Conference, Oct 2012, Porto, Portugal. ⟨hal-00717366⟩



Record views


Files downloads