Video-informed acoustic transfer function estimation for enhancing audio source separation through noise source suppression

Abstract : This paper describes a method where an interference noise source within an audio source separation scenario is suppressed from a mixture. The principal idea of the proposed method is to use a video camera array for locating a interference noise source whose 3D position will be used to estimate a matrix of frequency responses (FRs) by linearly combining a series of previously known FRs. A filter is calculated to remove the contribution of the noise source from a convolutive mixture at each microphone, through the estimated FRs. The proposed method is assumed to implemented in a 'block-wise' manner in time domain and has been tested on mixtures created by impulse responses generated by the image method for small room acoustics.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00931102
Contributor : Bertrand Rivet <>
Submitted on : Tuesday, January 14, 2014 - 8:47:07 PM
Last modification on : Monday, July 8, 2019 - 3:08:55 PM

Identifiers

Citation

Jack Harris, Bertrand Rivet, Naqvi Syed Mohsen, Jonathon Chambers, Christian Jutten. Video-informed acoustic transfer function estimation for enhancing audio source separation through noise source suppression. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013), Sep 2013, Southampton, United Kingdom. pp.1-6, ⟨10.1109/MLSP.2013.6661927⟩. ⟨hal-00931102⟩

Share

Metrics

Record views

275