I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION

Abstract : Short speech duration remains a critical factor of performance degradation when deploying a speaker verification system. To overcome this difficulty, a large number of commercial applications impose the use of fixed pass-phrases. In this context , we show that the performance of the popular i-vector approach can be greatly improved by taking advantage of the phonetic information that they convey. Moreover, as i-vectors require a conditioning process to reach high accuracy, we show that further improvements are possible by taking advantage of this phonetic information within the normalisation process. We compare two methods, Within Class Covariance Normalization (WCCN) and Eigen Factor Radial (EFR), both relying on parameters estimated on the same development data. Our study suggests that WCCN is more robust to data mismatch but less efficient than EFR when the development data has a better match with the test data.
Document type :
Conference papers
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01927733
Contributor : Anthony Larcher <>
Submitted on : Tuesday, November 20, 2018 - 9:57:39 AM
Last modification on : Tuesday, July 2, 2019 - 5:38:02 PM

File

3649.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01927733, version 1

Collections

Citation

Anthony Larcher, Pierre-Michel Bousquet, Kong Lee, Driss Matrouf, Haizhou Li, et al.. I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION. IEEE International Conference on Acoustic Speech and Signal Processing, Mar 2012, Kyoto, Japan. ⟨hal-01927733⟩

Share

Metrics

Record views

11

Files downloads

20