Visual Based Reference for Enhanced Audio-Video Source Extraction

Jack Harris 1, 2 Naqvi Syed Mohsen 3 Bertrand Rivet 1 Jonathon Chambers 2 Christian Jutten 1
GIPSA-DIS - Département Images et Signal
3 Advanced Signal Processing Group
School of Electronic, Electrical and Systems Engineering
Abstract : This paper addresses the problem of source extraction in a complex scene where only moving audio sources are present. An algorithm using a unique yet simple method avoiding higher-order statistics has been developed. The principle idea of the algorithm is to use a video camera array for locating a moving source whose position is used to isolate a noise reference, and thus allowing noise subtraction from the mixture based on the widely-known Widrow adaptive filtering method, that only uses second-order statistics. This adaptive approach provides an alternative to traditional methods particularly when there is need for a real time implementation.
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download
Contributor : Jack Harris <>
Submitted on : Monday, January 28, 2013 - 1:45:35 PM
Last modification on : Monday, July 8, 2019 - 3:08:54 PM
Long-term archiving on : Monday, June 17, 2013 - 4:12:44 PM


Files produced by the author(s)


  • HAL Id : hal-00781793, version 1


Jack Harris, Naqvi Syed Mohsen, Bertrand Rivet, Jonathon Chambers, Christian Jutten. Visual Based Reference for Enhanced Audio-Video Source Extraction. 9th IMA International Conference on Mathematics in Signal Processing, Jan 2012, Birmingham, United Kingdom. pp.n/c. ⟨hal-00781793⟩



Record views


Files downloads