MLLR Techniques for Speaker Recognition

Marc Ferràs; Cheung Chi Leung; Claude Barras; Jean-Luc Gauvain

Communication Dans Un Congrès Année : 2008

MLLR Techniques for Speaker Recognition

(1) , (1) , (1) , (1)

Marc Ferràs

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Cheung Chi Leung

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Claude Barras

Fonction : Auteur
PersonId : 17217
IdHAL : claude-barras
IdRef : 165065583

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Jean-Luc Gauvain

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

Maximum-Likelihood Linear Regression (MLLR) and Constrained MLLR (CMLLR) have been recently used for feature extraction in speaker recognition. These systems use (C)MLLR transforms as features that are modeled with Support Vector Machines (SVM). This paper evaluates and compares several of these approaches for the NIST Speaker Recognition task. Single CMLLR and up to 4-phonetic-class MLLR transforms are explored using Gaussian Mixture Models (GMM) and large-vocabulary speech recognition Hidden Markov Models (HMM), using both speaker recognition and speech recognition cepstral front-ends and normalizations. Results for the individual systems as well as in combination with two standard cep-stral systems are provided. Relative gains of 3% and 12% were obtained when combining the best performing CMLLR-based and MLLR-based systems with two standard cepstral systems, respectively.

Domaines

Informatique [cs] Traitement du signal et de l'image [eess.SP]

Fichier principal

od08_023.pdf (200.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Claude Barras : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01690275

Soumis le : lundi 22 janvier 2018-20:36:51

Dernière modification le : samedi 7 octobre 2023-21:36:20

Archivage à long terme le : jeudi 24 mai 2018-09:24:42

Dates et versions

hal-01690275 , version 1 (22-01-2018)

Identifiants

HAL Id : hal-01690275 , version 1

Citer

Marc Ferràs, Cheung Chi Leung, Claude Barras, Jean-Luc Gauvain. MLLR Techniques for Speaker Recognition. Odyssey 2008: The Speaker and Language Recognition Workshop, Jan 2008, Stellenbosch, South Africa. ⟨hal-01690275⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI SORBONNE-UNIVERSITE LISN

34 Consultations

76 Téléchargements

MLLR Techniques for Speaker Recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager