Informed Audio Source Separation Using Linearly Constrained Spatial Filters

Stanislaw Gorlow 1, * Sylvain Marchand 2
* Corresponding author
Lab-STICC - Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance
Abstract : In this work we readdress the issue of audio source separation in an informed scenario, where certain information about the sound sources is embedded into their mixture as an imperceptible watermark. In doing so, we provide a description of an improved algorithm that follows the linearly constrained minimum-variance filtering approach in the subband domain, in order to obtain perceptually better estimates of the source signals in comparison to other published approaches. Just as its predecessor, the algorithm does not impose any restrictions on the number of simultaneously active sources, neither on their spectral overlap. It rather adapts to a given signal constellation and provides the best possible estimates under given constraints in linearithmic time. The validity of the approach is demonstrated on a stereo mixture with two levels of sound complexity. It is also shown by means of both objective and subjective evaluation that the proposed algorithm outperforms a reference algorithm by at least one grade. Bearing high perceptual resemblance to the original signals at a fairly tolerable data rate of 10-20 kbps per source, the algorithm hence seems well-suited for active listening applications such as re-mixing or re-spatialization in real time.
Document type :
Journal articles
Liste complète des métadonnées

Cited literature [16 references]  Display  Hide  Download
Contributor : Stanislaw Gorlow <>
Submitted on : Monday, April 22, 2013 - 4:24:45 PM
Last modification on : Monday, February 25, 2019 - 3:14:11 PM
Document(s) archivé(s) le : Tuesday, July 23, 2013 - 2:25:09 AM


Files produced by the author(s)



Stanislaw Gorlow, Sylvain Marchand. Informed Audio Source Separation Using Linearly Constrained Spatial Filters. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2013, 21 (1), pp.3-13. ⟨10.1109/TASL.2012.2208629⟩. ⟨hal-00725428⟩



Record views


Files downloads