A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition

Mohamed Bouallegue; Driss Matrouf; Georges Linares

doi:10.1109/ICASSP.2011.5947453

Communication Dans Un Congrès Année : 2011

A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition

(1) , (1) , (1)

Mohamed Bouallegue

Fonction : Auteur correspondant
PersonId : 981952

Connectez-vous pour contacter l'auteur

Laboratoire Informatique d'Avignon

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

Georges Linares

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Résumé

Speech recognition applications are known to require a significant amount of resources (memory, computing power). However , embedded speech recognition systems, such as in mobile phones, only authorizes few KB of memory and few MIPS. In the context of HMM-based speech recognizers, each HMM-state distribution is modeled independently from to the other and has a large amount of parameters. In spite of using state-tying techniques, the size of the acoustic models stays large and certain redundancy remains between states. In this paper, we investigate the capacity of the Subspace Gaussian Mixture approach to reduce the acoustic models size while keeping good performances. We introduce a simplification concerning state specific Gaussians weights estimation, which is a very complex and time consuming procedure in the original approach. With this approach, we show that the acoustic model size can be reduced by 92% with almost the same performance as the standard acoustic modeling. Index Terms— Compact Acoustic Models, Subspace Gaussian Mixture, Embedded speech recognition, Gaussian Mixture Models, Hidden Markov Models

Domaines

Informatique [cs]

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01313109

Soumis le : lundi 9 mai 2016-15:38:16

Dernière modification le : mardi 14 janvier 2020-10:38:06

Dates et versions

hal-01313109 , version 1 (09-05-2016)

Identifiants

HAL Id : hal-01313109 , version 1
DOI : 10.1109/ICASSP.2011.5947453

Citer

Mohamed Bouallegue, Driss Matrouf, Georges Linares. A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2011, Prague, Czech Republic. ⟨10.1109/ICASSP.2011.5947453⟩. ⟨hal-01313109⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

50 Consultations

0 Téléchargements

A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager