Skip to Main content Skip to Navigation
Conference papers

Imperfect transcript driven speech recognition

Abstract : In many cases, textual information can be associated with speech signals such as movie subtitles, theater scenarios, broadcast news summaries etc. This information could be considered as approximated transcripts and corresponds rarely to the exact word utterances. The goal of this work is to use this kind of information to improve the performance of an automatic speech recognition (ASR) system. Multiple applications are possible: to follow a play with closed caption aligned to the voice signal (while respecting to performer variations) to help deaf people, to watch a movie in another language using aligned and corrected closed captions, etc. We propose in this paper a method combining a linguistic analysis of the imperfect transcripts and a dynamic synchronization of these transcripts inside the search algorithm. The proposed technique is based on language model adaptation and on-line synchronization of the search algorithm. Experiments are carried out on an extract of the ESTER evaluation campaign [4] database, using the LIA Broadcast News system. The results show that the transcript-driven system outperforms significantly both the original recognizer and the imperfect transcript itself.
Document type :
Conference papers
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon Connect in order to contact the contributor
Submitted on : Thursday, November 9, 2017 - 9:26:04 AM
Last modification on : Thursday, July 2, 2020 - 9:04:07 PM
Long-term archiving on: : Saturday, February 10, 2018 - 12:19:59 PM

Files produced by the author(s)


  • HAL Id : hal-01318085, version 1



Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François Bonastre. Imperfect transcript driven speech recognition. INTERSPEECH, Sep 2006, Pittsburgh, United States. ⟨hal-01318085⟩



Les métriques sont temporairement indisponibles