Skip to Main content Skip to Navigation
Book sections


Abstract : Speech recognition applications are known to require substantial amount of resources in terms of training data, memory and computing power. However, the targeted context of this work-embedded mobile phone speech recognition systems-only authorizes few KB of memory, few MIPS and usually a small amount of training data. In order to meet the resource constraints, an approach based on an HMM system using a GMM-based state-independent acoustic modeling is proposed in this paper. A transformation is computed and applied to the global GMM in order to obtain each of the HMM state-dependent probability density functions. This strategy aims at storing only the transformation function parameters for each state and enables to decrease the amount of computing power needed for the likelihood computation. The proposed approach is evaluated with a digit recognition task using the French corpus BDSON. Our method allows a Digit Error Rate (DER) of 2.1 %, when the system respects the resource constraints. Compared to a standard HMM with comparable resources, our approach achieved a relative DER decrease of about 52%.
Document type :
Book sections
Complete list of metadatas
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Thursday, May 19, 2016 - 1:51:48 PM
Last modification on : Tuesday, January 14, 2020 - 10:38:06 AM


  • HAL Id : hal-01318237, version 1



Christophe Levy, Georges Linares, Pascal Nocera, Jean-François Bonastre. Chapter 7 EMBEDDED MOBILE PHONE DIGIT-RECOGNITION. Advances for In-Vehicle and Mobile Systems, 2007. ⟨hal-01318237⟩



Record views