Comparing Prosodic Models for Speaker Recognition

Cheung-Chi Leung; Marc Ferràs; Claude Barras; Jean-Luc Gauvain

Communication Dans Un Congrès Année : 2008

Comparing Prosodic Models for Speaker Recognition

(1) , (1) , (1) , (1)

Cheung-Chi Leung

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Marc Ferràs

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Claude Barras

Fonction : Auteur
PersonId : 17217
IdHAL : claude-barras
IdRef : 165065583

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Jean-Luc Gauvain

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

Recently, speaker verification systems using different kinds of prosodic features have been proposed. Although it has been shown that most of these speaker verification systems can improve system performance using score-level fusion with state-of-the-art cepstral-based systems, a systematic comparison of the prosodic modelling algorithms used in these prosodic systems has not yet been performed. This motivated us to review the proposed prosodic modelling algorithms and compare them using a common experimental condition. These experiments explored different approaches in the sampling/segmentation of prosodic contours and the selection of prosodic features. They show that simple prosodic systems with features extracted from fixed-size contour segments, without knowledge of phone/pseudo-syllable level information, still provide significant performance improvement when fused with a state-of-the-art cepstral-based system. Moreover, some prosodic systems are shown to be complementary to each other. Fusion of these systems with the cepstral-based system can provide further performance improvement on the speaker verification task.

Domaines

Traitement du signal et de l'image [eess.SP] Informatique [cs]

Fichier principal

i08_1945.pdf (180.54 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Claude Barras : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01690268

Soumis le : mardi 23 janvier 2018-17:16:21

Dernière modification le : samedi 7 octobre 2023-21:36:20

Archivage à long terme le : jeudi 24 mai 2018-09:51:07

Dates et versions

hal-01690268 , version 1 (23-01-2018)

Identifiants

HAL Id : hal-01690268 , version 1

Citer

Cheung-Chi Leung, Marc Ferràs, Claude Barras, Jean-Luc Gauvain. Comparing Prosodic Models for Speaker Recognition. Interspeech 2008, Sep 2008, Brisbane, Australia. ⟨hal-01690268⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN

27 Consultations

35 Téléchargements

Comparing Prosodic Models for Speaker Recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager