| HAL : hal-00717366, version 1 |
| Fiche détaillée | Récupérer au format |
|
|
| ISMIR 2012 : 13th International Society for Music Information Retrieval Conference, Porto : Portugal (2012) |
|
|
|
|
| Semi-supervised {NMF} with time-frequency annotations for single-channel source separation |
|
|
| Augustin Lefèvre 1Francis Bach 1, 2 |
|
|
| (08/10/2012) |
|
|
| We formulate a novel extension of nonnegative matrix factorization (NMF) to take into account partial information on source-specific activity in the spectrogram. This information comes in the form of masking coefficients, such as those found in an ideal binary mask. We show that state-of-the-art results in source separation may be achieved with only a limited amount of correct annotation, and furthermore our algorithm is robust to incorrect annotations. Since in practice ideal annotations are not observed, we propose several supervision scenarios to estimate the ideal mask- ing coefficients. First, manual annotations by a trained user on a dedicated graphical user interface are shown to provide satisfactory performance although they are prone to errors. Second, we investigate simple learning strate- gies to predict the Wiener coefficients based on local information around a given time-frequency bin of the spec- trogram. Results on single-channel source separation show that time-frequency annotations allow to disambiguate the source separation problem, and learned annotations open the way for a completely unsupervised learning procedure for source separation with no human intervention. |
|
|
|
|
|
|
|
|
|
|
| 1 : | SIERRA (INRIA Paris - Rocquencourt) |
| INRIA : PARIS - ROCQUENCOURT – Ecole normale supérieure de Paris - ENS Paris – CNRS : UMR8548 | |
| 2 : | Laboratoire d'informatique de l'école normale supérieure (LIENS) |
| CNRS : UMR8548 – Ecole normale supérieure de Paris - ENS Paris | |
| 3 : | Laboratoire Traitement et Communication de l'Information [Paris] (LTCI) |
| Télécom ParisTech – CNRS : UMR5141 | |
|
|
|
|
|
|
|
|
| Domaine | : | Mathématiques/Statistiques Statistiques/Théorie |
|
|
| nonnegative matrix factorization – inpainting – matrix completion – single channel source separation – blind source separation – unsupervised learning |
|
|
| Liste des fichiers attachés à ce document : | |||||
|
|
|
| hal-00717366, version 1 | |
| http://hal.archives-ouvertes.fr/hal-00717366 | |
| oai:hal.archives-ouvertes.fr:hal-00717366 | |
| Contributeur : Augustin Lefèvre | |
| Soumis le : Jeudi 12 Juillet 2012, 16:45:21 | |
| Dernière modification le : Lundi 16 Juillet 2012, 13:29:36 | |