| HAL: inria-00566868, version 1 |
| DOI: 10.1109/ICASSP.2011.5946315 |
| See detailed view | BibTeX,EndNote,... |
|
|
| Acoustics, Speech and Signal Processing, IEEE Conference on (ICASSP'11), Prague : Tchèque, République (2011) |
|
|
|
|
| An acoustically-motivated spatial prior for under-determined reverberant source separation |
|
|
| Ngoc Q. K. Duong 1Emmanuel Vincent 1 |
|
|
| (2011-02-17) |
|
|
| We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random vector with full-rank spatial covariance matrix. We introduce an inverse Wishart prior over the covariance matrices, whose mean is given by the theory of statistical room acoustics and whose variance is learned from training data. We then derive an Expectation-Maximization (EM) algorithm to estimate the model parameters in the Maximum A Posteriori (MAP) sense given prior knowledge about the microphone spacing and the source positions. This algorithm provides a principled solution to the well-known permutation problem and achieves better separation performance than other algorithms exploiting the same prior knowledge. |
|
|
|
|
|
|
|
|
|
|
| 1: | METISS (INRIA - IRISA) |
| CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1 | |
|
|
|
|
|
|
|
|
| Domain | : | Statistics/Machine Learning |
|
|
| Under-determined convolutive source separation – Full-rank spatial covariance – Statistical room acoustics – Inverse-Wishart prior |
|
|
| Attached file list to this document: | |||||
|
|
|
| inria-00566868, version 1 | |
| http://hal.inria.fr/inria-00566868 | |
| oai:hal.inria.fr:inria-00566868 | |
| From: Ngoc Duong | |
| Submitted on: Sunday, 20 February 2011 19:57:38 | |
| Updated on: Friday, 30 September 2011 11:58:45 | |