Under-determined reverberant audio source separation using a full-rank spatial covariance model

Ngoc Duong 1 Emmanuel Vincent 1 Rémi Gribonval 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covariance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectationmaximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures to initialize the parameters and to align the order of the estimated sources across all frequency bins based on their estimated directions of arrival (DOA). Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach.
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/inria-00435807
Contributor : Ngoc Duong <>
Submitted on : Monday, December 14, 2009 - 1:34:46 PM
Last modification on : Thursday, March 21, 2019 - 2:20:42 PM
Long-term archiving on : Thursday, September 23, 2010 - 11:24:56 AM

Files

rr-7116.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00435807, version 2
  • ARXIV : 0912.0171

Citation

Ngoc Duong, Emmanuel Vincent, Rémi Gribonval. Under-determined reverberant audio source separation using a full-rank spatial covariance model. [Research Report] INRIA. 2010. ⟨inria-00435807v2⟩

Share

Metrics

Record views

894

Files downloads

1031