Comparing Prosodic Models for Speaker Recognition - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Comparing Prosodic Models for Speaker Recognition

Résumé

Recently, speaker verification systems using different kinds of prosodic features have been proposed. Although it has been shown that most of these speaker verification systems can improve system performance using score-level fusion with state-of-the-art cepstral-based systems, a systematic comparison of the prosodic modelling algorithms used in these prosodic systems has not yet been performed. This motivated us to review the proposed prosodic modelling algorithms and compare them using a common experimental condition. These experiments explored different approaches in the sampling/segmentation of prosodic contours and the selection of prosodic features. They show that simple prosodic systems with features extracted from fixed-size contour segments, without knowledge of phone/pseudo-syllable level information, still provide significant performance improvement when fused with a state-of-the-art cepstral-based system. Moreover, some prosodic systems are shown to be complementary to each other. Fusion of these systems with the cepstral-based system can provide further performance improvement on the speaker verification task.
Fichier principal
Vignette du fichier
i08_1945.pdf (180.54 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01690268 , version 1 (23-01-2018)

Identifiants

  • HAL Id : hal-01690268 , version 1

Citer

Cheung-Chi Leung, Marc Ferràs, Claude Barras, Jean-Luc Gauvain. Comparing Prosodic Models for Speaker Recognition. Interspeech 2008, Sep 2008, Brisbane, Australia. ⟨hal-01690268⟩
27 Consultations
35 Téléchargements

Partager

Gmail Facebook X LinkedIn More