MLLR Techniques for Speaker Recognition

Abstract : Maximum-Likelihood Linear Regression (MLLR) and Constrained MLLR (CMLLR) have been recently used for feature extraction in speaker recognition. These systems use (C)MLLR transforms as features that are modeled with Support Vector Machines (SVM). This paper evaluates and compares several of these approaches for the NIST Speaker Recognition task. Single CMLLR and up to 4-phonetic-class MLLR transforms are explored using Gaussian Mixture Models (GMM) and large-vocabulary speech recognition Hidden Markov Models (HMM), using both speaker recognition and speech recognition cepstral front-ends and normalizations. Results for the individual systems as well as in combination with two standard cep-stral systems are provided. Relative gains of 3% and 12% were obtained when combining the best performing CMLLR-based and MLLR-based systems with two standard cepstral systems, respectively.
Document type :
Conference papers
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01690275
Contributor : Claude Barras <>
Submitted on : Monday, January 22, 2018 - 8:36:51 PM
Last modification on : Monday, September 16, 2019 - 11:45:23 AM
Long-term archiving on : Thursday, May 24, 2018 - 9:24:42 AM

File

od08_023.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01690275, version 1

Collections

Citation

Marc Ferràs, Cheung Leung, Claude Barras, Jean-Luc Gauvain. MLLR Techniques for Speaker Recognition. Odyssey 2008: The Speaker and Language Recognition Workshop, Jan 2008, Stellenbosch, South Africa. ⟨hal-01690275⟩

Share

Metrics

Record views

40

Files downloads

16