On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence

Axel Roebel 1 Jordi Pons Puig Marco Liuni 1 Mathieu Lagrange 2
1 Analyse et synthèse sonores [Paris]
STMS - Sciences et Technologies de la Musique et du Son
2 ADTSI
IRCCyN - Institut de Recherche en Communications et en Cybernétique de Nantes
Abstract : This paper presents an investigation into the detection and classification of drum sounds in polyphonic music and drum loops using non-negative matrix deconvolution (NMD) and the Itakura Saito divergence. The Itakura Saito divergence has recently been proposed as especially appropriate for decomposing audio spectra due to the fact that it is scale invariant, but it has not yet been widely adopted. The article studies new contributions for audio event detection methods using the Itakura Saito divergence that improve efficiency and numerical stability, and simplify the generation of target pattern sets. A new approach for handling background sounds is proposed and moreover, a new detection criteria based on estimating the perceptual presence of the target class sources is introduced. Experimental results obtained for drum detection in polyphonic music and drum soli demonstrate the beneficial effects of the proposed extensions.
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01261256
Contributor : Axel Roebel <>
Submitted on : Monday, January 25, 2016 - 9:39:30 AM
Last modification on : Thursday, March 21, 2019 - 1:04:44 PM

Identifiers

Citation

Axel Roebel, Jordi Pons Puig, Marco Liuni, Mathieu Lagrange. On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence. Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.414 - 418, ⟨10.1109/ICASSP.2015.7178002⟩. ⟨hal-01261256⟩

Share

Metrics

Record views

225