Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution

Abstract : In this paper, we propose a supervised multilayer factorization method designed for harmonic/percussive source separation and drum extraction. Our method decomposes the audio signals in sparse orthogonal components which capture the harmonic content, while the drum is represented by an extension of non negative matrix factorization which is able to exploit time-frequency dictionaries to take into account non stationary drum sounds. The drum dictionaries represent various real drum hits and the decomposition has more physical sense and allows for a better interpretation of the results. Experiments on real music data for a harmonic/percussive source separation task show that our method outperforms other state of the art algorithms. Finally, our method is very robust to non stationary harmonic sources that are usually poorly decomposed by existing methods.
Document type :
Conference papers
Complete list of metadatas

Cited literature [26 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01438851
Contributor : Clément Laroche <>
Submitted on : Wednesday, January 18, 2017 - 10:56:53 AM
Last modification on : Monday, March 25, 2019 - 4:16:07 PM
Long-term archiving on : Wednesday, April 19, 2017 - 1:32:19 PM

File

Template.pdf
Files produced by the author(s)

Identifiers

Citation

Clément Laroche, Hélène Papadopoulos, Matthieu Kowalski, Gaël Richard. Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution. The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), Mar 2017, Nouvelle Orleans, United States. ⟨10.1109/icassp.2017.7952115 ⟩. ⟨hal-01438851⟩

Share

Metrics

Record views

973

Files downloads

944