Decoupling session variability modelling and speaker characterisation

Anthony Larcher; Christophe Lévy; Driss Matrouf; Jean-François Bonastre

Communication Dans Un Congrès Année : 2010

Decoupling session variability modelling and speaker characterisation

(1) , (1) , (1) , (1)

Anthony Larcher

Fonction : Auteur
PersonId : 20105
IdHAL : anthony-larcher
ORCID : 0000-0003-4398-0224
IdRef : 139544569

Laboratoire Informatique d'Avignon

Christophe Lévy

Fonction : Auteur

Laboratoire Informatique d'Avignon

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

Jean-François Bonastre

Fonction : Auteur
PersonId : 172421
IdHAL : jean-francois-bonastre
ORCID : 0000-0001-7741-3346
IdRef : 079112978

Laboratoire Informatique d'Avignon

Résumé

The Factor Analysis framework demonstrated its high power to model session variability during the past years. However, training the FA parameters implies to have a large amount of training data. When the size of the available database is limited, the number of components of the core statistical model, the UBM, is also limited as the UBM drives the dimension of the FA main matrix. As the size of the UBM gives directly the size of the speaker supervector (concatenation of the GMM mean parameters), it limits also the intrinsic capacity of the recognition system , reducing the performance expectation. This paper aims to withdraw this limitation by breaking the intrinsic link between the FA dimensionality and the UBM dimensionality. The session variability modelling is done on a smaller dimension compared to the UBM, which drives the discriminative power of the system. The first experimental results proposed in this paper, done using the NIST-SRE 2008 framework, are encouraging with a relative EER improvement of about 18% when a 512 components UBM is associated to a 32 components session variability modelling compared with a 32 components UBM associated with the same variability modelling.

Mots clés

Index Terms: speaker verification GMM EigenChannel Adaptation Session Variability

Domaines

Informatique [cs]

Fichier principal

Expanded_FA.pdf (246.35 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01317698

Soumis le : lundi 19 novembre 2018-10:31:20

Dernière modification le : mardi 14 janvier 2020-10:38:06

Archivage à long terme le : mercredi 20 février 2019-13:03:48

Dates et versions

hal-01317698 , version 1 (19-11-2018)

Identifiants

HAL Id : hal-01317698 , version 1

Citer

Anthony Larcher, Christophe Lévy, Driss Matrouf, Jean-François Bonastre. Decoupling session variability modelling and speaker characterisation. INTERSPEECH, Sep 2010, Makuhari, Japan. ⟨hal-01317698⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

61 Consultations

32 Téléchargements

Decoupling session variability modelling and speaker characterisation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager