Reducing computational and memory cost for cellular phone embedded speech recognition system - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2004

Reducing computational and memory cost for cellular phone embedded speech recognition system

Résumé

This paper is focused on cellular phone embedded speech recognition. We present several methods able to fit speech recognition system requirements to cellular phone resource. The proposed techniques are evaluated on a digit recognition task using both French and English corpora. We investigate particularly three aspects of speech processing: acoustic parameterization, recognition algorithms and acoustic modeling. Several parameterization algorithms (LPCC, MFCC and PLP) are compared to the Linear Predictive Coding (LPC) included in the GSM norm. The MFCC and PLP parameterization algorithms perform significantly better than the other ones. Moreover, feature vector size can be reduced until 6 PLP coefficients allowing to decrease memory and computation resources without a significant loss of performance. In order to achieve good performance with reasonable resource needs, we develop several methods to embed classical HMM-based speech recognition system in cellular phone. We first propose an automatic on-line building of phonetic lexicon which allows a minimal but unlimited lexicon. Then we reduce the HMM model complexity by decreasing the number of (Gaussian) components per state. Finally, we evaluate our propositions by comparing Dynamic Time Warping (DTW) with our HMM system-in the context of cellular phone-for clean conditions. The experiments show that our HMM system outperforms DTW for speaker independent task and allows more practical applications for the cellular-phone user interface.
Fichier non déposé

Dates et versions

hal-01318230 , version 1 (19-05-2016)

Identifiants

Citer

Christophe Lévy, Georges Linarès, Pascal Nocera, Jean-François Bonastre. Reducing computational and memory cost for cellular phone embedded speech recognition system. IEEE International Conference on Acoustics, Speech, and Signal Processing. (ICASSP '04)., May 2004, Montreal, Canada. ⟨10.1109/ICASSP.2004.1327109⟩. ⟨hal-01318230⟩

Collections

UNIV-AVIGNON LIA
119 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More