Constrained Viterbi decoding for embedded user-customised password speaker recognition

Abstract : Embedded speaker recognition in mobile devices could involve several ergonomic constraints and a limited amount of computing resources. GMM/UBM systems have proved their efficiency in more classical contexts where good accuracy depends on a relatively large quantity of speech data. The proposed GMM/UBM extension addresses the situations with limited resources and takes advantage from the temporal structure of speech by using client-customised utterances harnessed by a Markov model. New temporal information is then used to enhance discrimination with Viterbi decoding increasing the gap between client and impostor scores. Experiments on the MyIdea database are performed when impostors know the client-utterance and also when they do not, highlighting the potential of this new approach. A relative gain up to 64% in terms of EER is achieved when impostors do not know the client utterances and performance is equivalent to the GMM/UBM baseline system in other configurations.
Type de document :
Communication dans un congrès
SAC 10, Mar 2010, Sierre, Switzerland. Symposium on Applied Computing (SAC 10), 2010, 〈10.1145/1774088.1774410〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01312781
Contributeur : Bibliothèque Universitaire Déposants Hal-Avignon <>
Soumis le : vendredi 30 novembre 2018 - 14:14:25
Dernière modification le : vendredi 30 novembre 2018 - 14:21:45

Fichier

sac_10 (1).pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Anthony Larcher, Jean-François Bonastre, John S.D. Mason. Constrained Viterbi decoding for embedded user-customised password speaker recognition. SAC 10, Mar 2010, Sierre, Switzerland. Symposium on Applied Computing (SAC 10), 2010, 〈10.1145/1774088.1774410〉. 〈hal-01312781〉

Partager

Métriques

Consultations de la notice

49

Téléchargements de fichiers

7