Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors

Abstract : This paper investigates the use of musical priors for sparse expansion of audio signals of music, on an overcomplete dual-resolution dictionary taken from the union of two orthonormal bases that can describe both transient and tonal components of a music audio signal. More specifically, chord and metrical structure information are used to build a structured model that takes into account dependencies between coefficients of the decomposition, both for the tonal and for the transient layer. The denoising task application is used to provide a proof of concept of the proposed musical priors. Several configurations of the model are analyzed. Evaluation on monophonic and complex polyphonic excerpts of real music signals shows that the proposed approach provides results whose quality measured by the signal-to-noise ratio is competitive with state-of-the-art approaches, and more coherent with the semantic content of the signal. A detailed analysis of the model in terms of sparsity and in terms of interpretability of the representation is also provided, and shows that the model is capable of giving a relevant and legible representation of Western tonal music audio signals.
Liste complète des métadonnées

Cited literature [25 references]  Display  Hide  Download
Contributor : I Papadopoulos <>
Submitted on : Sunday, October 6, 2013 - 5:31:28 PM
Last modification on : Saturday, September 8, 2018 - 4:24:02 PM
Document(s) archivé(s) le : Thursday, April 6, 2017 - 8:52:43 PM


Files produced by the author(s)




Hélène Papadopoulos, Matthieu Kowalski. Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors. Journal of the Acoustical Society of America, Acoustical Society of America, 2013, 134 (1), pp.666-685. ⟨10.1121/1.4807821⟩. ⟨hal-00823059v2⟩



Record views


Files downloads