Evaluating grapheme-to-phoneme converters in automatic speech recognition context - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Evaluating grapheme-to-phoneme converters in automatic speech recognition context

Denis Jouvet
Dominique Fohr
Irina Illina

Résumé

This paper deals with the evaluation of grapheme-to-phoneme (G2P) converters in a speech recognition context. The precision and recall rates are investigated as potential measures of the quality of the multiple generated pronunciation variants. Very different results are obtained whether or not we take into account the frequency of occurrence of the words. Since G2P systems are rarely evaluated on a speech recognition performance basis, the originality of this paper consists in using a speech recognition system to evaluate the G2P pronunciation variants. The results show that the training process is quite robust to some errors in the pronunciation lexicon, whereas pronunciation lexicon errors are harmful in the decoding process. Noticeable speech recognition performance improvements are achieved by combining two different G2P converters, one based on conditional random fields and the other on joint multigram models, as well as by checking the pronunciation variants of the most frequent words.
Fichier principal
Vignette du fichier
EvalG2PinASRcontext-V1.1.pdf (268.82 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00753364 , version 1 (14-09-2015)

Identifiants

Citer

Denis Jouvet, Dominique Fohr, Irina Illina. Evaluating grapheme-to-phoneme converters in automatic speech recognition context. ICASSP - 2012 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2012, Kyoto, Japan. pp.4821 - 4824, ⟨10.1109/ICASSP.2012.6288998⟩. ⟨hal-00753364⟩
284 Consultations
890 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More