4421 articles – 2353 references  [version française]
HAL: inria-00566868, version 1

See detailed view  BibTeX,EndNote,...
Acoustics, Speech and Signal Processing, IEEE Conference on (ICASSP'11), Prague : Tchèque, République (2011)
An acoustically-motivated spatial prior for under-determined reverberant source separation
Ngoc Q. K. Duong 1, Emmanuel Vincent 1, Remi Gribonval 1
(2011-02-17)

We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random vector with full-rank spatial covariance matrix. We introduce an inverse Wishart prior over the covariance matrices, whose mean is given by the theory of statistical room acoustics and whose variance is learned from training data. We then derive an Expectation-Maximization (EM) algorithm to estimate the model parameters in the Maximum A Posteriori (MAP) sense given prior knowledge about the microphone spacing and the source positions. This algorithm provides a principled solution to the well-known permutation problem and achieves better separation performance than other algorithms exploiting the same prior knowledge.
1:  METISS (INRIA - IRISA)
CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
Statistics/Machine Learning
Under-determined convolutive source separation – Full-rank spatial covariance – Statistical room acoustics – Inverse-Wishart prior
Attached file list to this document: 
PDF
icassp2011.pdf(310.8 KB)