Speech/music discrimination based on wavelets for broadcast programs - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Speech/music discrimination based on wavelets for broadcast programs

Irina Illina
Odile Mella
Dominique Fohr

Résumé

The problem of speech/music discrimination is a challenging research problem which significantly impacts Automatic Speech Recognition (ASR) performance. This paper proposes new features for the Speech/Music discrimination task. We propose to use a decomposition of the audio signal based on wavelets, which allows a good analysis of non stationary signal like speech or music. We compute different energy types in each frequency band obtained from wavelet decomposition. Two class/non-class classifiers are used : one for speech/non-speech, one for music/non-music. On the broadcast test corpus, the proposed wavelet approach gives better results than the MFCC one. For instance, we have a significant relative improvements of the error rate of 39% for the speech/music discrimination task.
Fichier non déposé

Dates et versions

hal-00103554 , version 1 (04-10-2006)

Identifiants

  • HAL Id : hal-00103554 , version 1

Citer

Emmanuel Didiot, Irina Illina, Odile Mella, Dominique Fohr, Jean-Paul Haton. Speech/music discrimination based on wavelets for broadcast programs. 2006, pp.151. ⟨hal-00103554⟩
116 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More