Comparing Prosodic Models for Speaker Recognition

Abstract : Recently, speaker verification systems using different kinds of prosodic features have been proposed. Although it has been shown that most of these speaker verification systems can improve system performance using score-level fusion with state-of-the-art cepstral-based systems, a systematic comparison of the prosodic modelling algorithms used in these prosodic systems has not yet been performed. This motivated us to review the proposed prosodic modelling algorithms and compare them using a common experimental condition. These experiments explored different approaches in the sampling/segmentation of prosodic contours and the selection of prosodic features. They show that simple prosodic systems with features extracted from fixed-size contour segments, without knowledge of phone/pseudo-syllable level information, still provide significant performance improvement when fused with a state-of-the-art cepstral-based system. Moreover, some prosodic systems are shown to be complementary to each other. Fusion of these systems with the cepstral-based system can provide further performance improvement on the speaker verification task.
Document type :
Conference papers
Complete list of metadatas

Cited literature [14 references]  Display  Hide  Download
Contributor : Claude Barras <>
Submitted on : Tuesday, January 23, 2018 - 5:16:21 PM
Last modification on : Tuesday, September 17, 2019 - 1:13:03 AM
Long-term archiving on : Thursday, May 24, 2018 - 9:51:07 AM


Publisher files allowed on an open archive


  • HAL Id : hal-01690268, version 1



Cheung-Chi Leung, Marc Ferràs, Claude Barras, Jean-Luc Gauvain. Comparing Prosodic Models for Speaker Recognition. Interspeech 2008, Sep 2008, Brisbane, Australia. ⟨hal-01690268⟩



Record views


Files downloads