Under-determined reverberant audio source separation using a full-rank spatial covariance model

Ngoc Q. K. Duong; Emmanuel Vincent; Rémi Gribonval

Rapport (Rapport De Recherche) Année : 2010

Under-determined reverberant audio source separation using a full-rank spatial covariance model

(1) , (1) , (1)

Ngoc Q. K. Duong

Fonction : Auteur
PersonId : 864978

Speech and sound data modeling and processing

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech and sound data modeling and processing

Rémi Gribonval

Fonction : Auteur
PersonId : 1255
IdHAL : remi-gribonval
ORCID : 0000-0002-9450-8125
IdRef : 113181590

Speech and sound data modeling and processing

Résumé

This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covariance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectationmaximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures to initialize the parameters and to align the order of the estimated sources across all frequency bins based on their estimated directions of arrival (DOA). Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach.

Mots clés

permutation problem Convolutive blind source separation underdetermined mixtures spatial covariance models EM algorithm permutation problem.

Domaines

Machine Learning [stat.ML] Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

rr-7116.pdf (451.81 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ngoc Duong : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00435807

Soumis le : lundi 14 décembre 2009-13:34:46

Dernière modification le : vendredi 24 mars 2023-14:52:52

Archivage à long terme le : jeudi 23 septembre 2010-11:24:56

Dates et versions

inria-00435807 , version 1 (25-11-2009)

inria-00435807 , version 2 (14-12-2009)

Identifiants

HAL Id : inria-00435807 , version 2
ARXIV : 0912.0171

Citer

Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval. Under-determined reverberant audio source separation using a full-rank spatial covariance model. [Research Report] INRIA. 2010. ⟨inria-00435807v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA INRIA-RRRT IRISA-D5 INRIA2 LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

604 Consultations

1821 Téléchargements

Under-determined reverberant audio source separation using a full-rank spatial covariance model

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager