The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news

Abstract : This paper presents the system used by the LIUM to participate in ESTER, the french broadcast news evaluation campaign. This system is based on the CMU Sphinx 3.3 (fast) decoder. Some tools are presented which have been added on different steps of the Sphinx recognition process: segmentation, acoustic model adaptation, word-lattice rescoring. Several experiments have been conducted on studying the effects of the signal segmentation on the recognition process, on injecting automatically transcribed data into training corpora, or on testing different approaches for acoustic model adaptation. The results are presented in this paper. With very few modifications and a simple MAP acoustic model estimation, Sphinx3.3 decoder reached a word error rate of 28.2%. The entire system developed by LIUM obtained 23.6% as official word error rate for the ESTER evaluation, and 23.4% as result of an unsubmited system.
Type de document :
Communication dans un congrès
9th European Conference on Speech Communication and Technology (Interspeech 2005), Sep 2005, Lisbonne, Portugal. Interspeech'2005 - Eurospeech 9th European Conference on Speech Communication and Technology, 2005
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01434282
Contributeur : Sylvain Meignier <>
Soumis le : mercredi 22 mars 2017 - 15:16:15
Dernière modification le : jeudi 6 avril 2017 - 10:13:39
Document(s) archivé(s) le : vendredi 23 juin 2017 - 13:19:39

Fichier

f55ad1cbbbab8fcb3db57c4f304916...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01434282, version 1

Collections

Citation

Paul Deléglise, Yannick Estève, Sylvain Meignier, Teva Merlin. The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news. 9th European Conference on Speech Communication and Technology (Interspeech 2005), Sep 2005, Lisbonne, Portugal. Interspeech'2005 - Eurospeech 9th European Conference on Speech Communication and Technology, 2005. 〈hal-01434282〉

Partager

Métriques

Consultations de la notice

144

Téléchargements de fichiers

101