An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function

Xiaofei Li 1 Laurent Girin 2, 1 Radu Horaud 1
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
2 GIPSA-CRISSP - CRISSP
GIPSA-DPC - Département Parole et Cognition
Abstract : This paper addresses the problem of audio source separation from (possibly under-determined) multichannel convolutive mixtures. We propose a separation method based on the convolutive transfer function (CTF) in the short-time Fourier transform domain. For strongly reverberant signals, the CTF is a much more appropriate model than the widely-used multiplicative transfer function approximation. An Expectation-Maximization (EM) algorithm is proposed to jointly estimate the model parameters, including the CTF coefficients of the mixing filters, and infer the sources. Experiments show that the proposed method provides very satisfactory performance on highly reverberant speech mixtures.
Type de document :
Communication dans un congrès
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States
Liste complète des métadonnées


https://hal.inria.fr/hal-01568818
Contributeur : Team Perception <>
Soumis le : mardi 25 juillet 2017 - 18:12:12
Dernière modification le : vendredi 28 juillet 2017 - 17:13:07

Fichier

Xiaofei_WASPAA_2017.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01568818, version 1

Citation

Xiaofei Li, Laurent Girin, Radu Horaud. An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. <hal-01568818>

Partager

Métriques

Consultations de
la notice

172

Téléchargements du document

47