Embedded Mobile Phone Digit-Recognition

Abstract : Speech recognition applications are known to require a significant amount of memory. However, the targeted context of this work-mobile phone embedded speech recognition system-only authorizes less than 100kB of memory. In order to fit the memory resource, a global codebook of Gaussians is learned to derive state-dependent probability density functions. This strategy aims at storing only the transformation function parameters for each state. In this paper, two upper limits (concerning the acoustic model size) are set to 50kB and 100kB. The proposed approches are evaluated on the French corpus VODIS (digit recognition-recorded into car with or without fan/opened window/radio-with a very low Signal/Noise Ratio). This preliminary study allows to build systems fitting the memory constraint with a DER (Digit Error Rate) around 10.9% (for model less than 100kB) which represents a DER absolute increase less than 1% compared to an HMM-based baseline system respecting the same memory constraint. Despite this increase, performance of both approaches remains comparable since the DER is still in the confident interval.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01318184
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Thursday, May 19, 2016 - 12:32:23 PM
Last modification on : Saturday, June 15, 2019 - 12:24:17 PM

Links full text

Identifiers

Collections

Citation

Christophe Lévy, Georges Linarès, Jean-François Bonastre. Embedded Mobile Phone Digit-Recognition. 8th International Symposium on DSP and Communication Systems,DSPCS'2005 & 4th Workshop on the Internet, Telecommunications and Signal Processing, WITSP'2005, Dec 2005, Noosa Heads, Australia. ⟨10.1007/978-0-387-45976-9_7⟩. ⟨hal-01318184⟩

Share

Metrics

Record views

73