Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation

Matthieu Kowalski 1 Emmanuel Vincent 2 Rémi Gribonval 2
2 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We consider the problem of extracting the source signals from an under-determined convolutive mixture assuming known mixing filters. State-of-the-art methods operate in the time-frequency domain and rely on narrowband approximation of the convolutive mixing process by complex-valued multiplication in each frequency bin. The source signals are then estimated by minimizing either a mixture fitting cost or a l1 source sparsity cost, under possible constraints on the number of active sources. In this article, we define a wideband l2 mixture fitting cost circumventing the above approximation and investigate the use of a l1,2 mixed-norm cost promoting disjointness of the source timefrequency representations. We design a family of convex functionals combining these costs and derive suitable optimization algorithms. Experiments indicate that the proposed wideband methods result in a signal-to-distortion ratio improvement of 2 to 5 dB compared to the state-of-the-art on reverberant speech mixtures.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2010, 18 (7), pp.1818 - 1829. 〈10.1109/TASL.2010.2050089〉
Liste complète des métadonnées

Littérature citée [28 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00435897
Contributeur : Matthieu Kowalski <>
Soumis le : samedi 20 novembre 2010 - 16:40:43
Dernière modification le : jeudi 21 mars 2019 - 14:20:42
Document(s) archivé(s) le : vendredi 2 décembre 2016 - 15:49:04

Fichier

kvg_taslp.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Matthieu Kowalski, Emmanuel Vincent, Rémi Gribonval. Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2010, 18 (7), pp.1818 - 1829. 〈10.1109/TASL.2010.2050089〉. 〈hal-00435897v3〉

Partager

Métriques

Consultations de la notice

1472

Téléchargements de fichiers

676