4421 articles – 2353 Notices  [english version]
HAL : inria-00594965, version 2

Voir la fiche détaillée  BibTeX,EndNote,...
IEEE Journal of Selected Topics in Signal Processing 5, 6 (2011) 1124-1132
Versions disponibles
Polyphonic pitch estimation and instrument identification by joint modeling of sustained and attack sounds
Jun Wu 1, Emmanuel Vincent 2, Stanislaw Andrzej Raczynski 1, Takuya Nishimoto 3, Nobutaka Ono 4, Shigeki Sagayama 1
(23/05/2011)

Polyphonic pitch estimation and musical instrument identification are some of the most challenging tasks in the field of Music Information Retrieval (MIR). While existing approaches have focused on the modeling of harmonic partials, we design a joint Gaussian mixture model of the harmonic partials and the inharmonic attack of each note. This model encodes the power of each partial over time as well as the spectral envelope of the attack part. We derive an Expectation-Maximization (EM) algorithm to estimate the pitch and the parameters of the notes. We then extract timbre features both from the harmonic and the attack part via Principal Component Analysis (PCA) over the estimated model parameters. Musical instrument recognition for each estimated note is finally carried out with a Support Vector Machine (SVM) classifier. Experiments conducted on mixtures of isolated notes as well as real-world polyphonic music show higher accuracy over state-of-the-art approaches based on the modeling of harmonic partials only.
1 :  University of Tokyo
University of Tokyo
2 :  METISS (INRIA - IRISA)
CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
3 :  Olarbee Japan
Olarbee
4 :  National Institute of Informatics [Tokyo] (NII)
National Institute of Informatics
Informatique/Traitement du signal et de l'image

Sciences de l'ingénieur/Traitement du signal et de l'image
Liste des fichiers attachés à ce document :
PDF
wu_JSTSP11.pdf(1.1 MB)