A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition

Abstract : Speech recognition applications are known to require a significant amount of resources (memory, computing power). However , embedded speech recognition systems, such as in mobile phones, only authorizes few KB of memory and few MIPS. In the context of HMM-based speech recognizers, each HMM-state distribution is modeled independently from to the other and has a large amount of parameters. In spite of using state-tying techniques, the size of the acoustic models stays large and certain redundancy remains between states. In this paper, we investigate the capacity of the Subspace Gaussian Mixture approach to reduce the acoustic models size while keeping good performances. We introduce a simplification concerning state specific Gaussians weights estimation, which is a very complex and time consuming procedure in the original approach. With this approach, we show that the acoustic model size can be reduced by 92% with almost the same performance as the standard acoustic modeling. Index Terms— Compact Acoustic Models, Subspace Gaussian Mixture, Embedded speech recognition, Gaussian Mixture Models, Hidden Markov Models
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01313109
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Monday, May 9, 2016 - 3:38:16 PM
Last modification on : Tuesday, July 2, 2019 - 5:38:02 PM

Identifiers

Collections

Citation

Mohamed Bouallegue, Driss Matrouf, Georges Linares. A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2011, Prague, Czech Republic. ⟨10.1109/ICASSP.2011.5947453⟩. ⟨hal-01313109⟩

Share

Metrics

Record views

51