Combined systems for automatic phonetic transcription of proper nouns

Abstract : Large vocabulary automatic speech recognition (ASR) technologies perform well in known, controlled contexts. However recognition of proper nouns is commonly considered as a difficult task. Accurate phonetic transcription of a proper noun is difficult to obtain, although it can be one of the most important resources for a recognition system. In this article, we propose methods of automatic phonetic transcription applied to proper nouns. The methods are based on combinations of the rule-based phonetic transcription generator LIA PHON and an acoustic-phonetic decoding system. On the ESTER corpus, we observed that the combined systems obtain better results than our reference system (LIA PHON). The WER (Word Error Rate) decreased on segments of speech containing proper nouns, without affecting negatively the results on the rest of the corpus. On the same corpus, the Proper Noun Error Rate (PNER, which is a WER computed on proper nouns only), decreased with our new system.
Type de document :
Communication dans un congrès
6th Language Evaluation and Resources Conference (LREC 2008), May 2008, Marrakech, Morocco. LREC 2008 Proceedings, pp.1791-1795, 2008, 〈http://www.lrec-conf.org/lrec2008/〉
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01433960
Contributeur : Sylvain Meignier <>
Soumis le : lundi 27 mars 2017 - 10:20:49
Dernière modification le : jeudi 6 avril 2017 - 10:12:09
Document(s) archivé(s) le : mercredi 28 juin 2017 - 12:30:50

Fichier

455_paper.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-01433960, version 1

Collections

Citation

Antoine Laurent, Teva Merlin, Sylvain Meignier, Yannick Estève, P Deléglise. Combined systems for automatic phonetic transcription of proper nouns. 6th Language Evaluation and Resources Conference (LREC 2008), May 2008, Marrakech, Morocco. LREC 2008 Proceedings, pp.1791-1795, 2008, 〈http://www.lrec-conf.org/lrec2008/〉. 〈hal-01433960〉

Partager

Métriques

Consultations de la notice

147

Téléchargements de fichiers

82