A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition

Driss Matrouf
Georges Linares

Résumé

Speech recognition applications are known to require a significant amount of resources (memory, computing power). However , embedded speech recognition systems, such as in mobile phones, only authorizes few KB of memory and few MIPS. In the context of HMM-based speech recognizers, each HMM-state distribution is modeled independently from to the other and has a large amount of parameters. In spite of using state-tying techniques, the size of the acoustic models stays large and certain redundancy remains between states. In this paper, we investigate the capacity of the Subspace Gaussian Mixture approach to reduce the acoustic models size while keeping good performances. We introduce a simplification concerning state specific Gaussians weights estimation, which is a very complex and time consuming procedure in the original approach. With this approach, we show that the acoustic model size can be reduced by 92% with almost the same performance as the standard acoustic modeling. Index Terms— Compact Acoustic Models, Subspace Gaussian Mixture, Embedded speech recognition, Gaussian Mixture Models, Hidden Markov Models
Fichier non déposé

Dates et versions

hal-01313109 , version 1 (09-05-2016)

Identifiants

Citer

Mohamed Bouallegue, Driss Matrouf, Georges Linares. A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2011, Prague, Czech Republic. ⟨10.1109/ICASSP.2011.5947453⟩. ⟨hal-01313109⟩

Collections

UNIV-AVIGNON LIA
50 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More