Skip to Main content Skip to Navigation
Conference papers

The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news

Abstract : This paper presents the system used by the LIUM to participate in ESTER, the french broadcast news evaluation campaign. This system is based on the CMU Sphinx 3.3 (fast) decoder. Some tools are presented which have been added on different steps of the Sphinx recognition process: segmentation, acoustic model adaptation, word-lattice rescoring. Several experiments have been conducted on studying the effects of the signal segmentation on the recognition process, on injecting automatically transcribed data into training corpora, or on testing different approaches for acoustic model adaptation. The results are presented in this paper. With very few modifications and a simple MAP acoustic model estimation, Sphinx3.3 decoder reached a word error rate of 28.2%. The entire system developed by LIUM obtained 23.6% as official word error rate for the ESTER evaluation, and 23.4% as result of an unsubmited system.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01434282
Contributor : Sylvain Meignier <>
Submitted on : Wednesday, March 22, 2017 - 3:16:15 PM
Last modification on : Thursday, April 6, 2017 - 10:13:39 AM
Document(s) archivé(s) le : Friday, June 23, 2017 - 1:19:39 PM

File

f55ad1cbbbab8fcb3db57c4f304916...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01434282, version 1

Collections

Citation

Paul Deléglise, Yannick Estève, Sylvain Meignier, Teva Merlin. The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news. 9th European Conference on Speech Communication and Technology (Interspeech 2005), Sep 2005, Lisbonne, Portugal. ⟨hal-01434282⟩

Share

Metrics

Record views

235

Files downloads

219