Imperfect transcript driven speech recognition - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Imperfect transcript driven speech recognition

Résumé

In many cases, textual information can be associated with speech signals such as movie subtitles, theater scenarios, broadcast news summaries etc. This information could be considered as approximated transcripts and corresponds rarely to the exact word utterances. The goal of this work is to use this kind of information to improve the performance of an automatic speech recognition (ASR) system. Multiple applications are possible: to follow a play with closed caption aligned to the voice signal (while respecting to performer variations) to help deaf people, to watch a movie in another language using aligned and corrected closed captions, etc. We propose in this paper a method combining a linguistic analysis of the imperfect transcripts and a dynamic synchronization of these transcripts inside the search algorithm. The proposed technique is based on language model adaptation and on-line synchronization of the search algorithm. Experiments are carried out on an extract of the ESTER evaluation campaign [4] database, using the LIA Broadcast News system. The results show that the transcript-driven system outperforms significantly both the original recognizer and the imperfect transcript itself.
Fichier principal
Vignette du fichier
10.1.1.619.7409.pdf (99.43 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01318085 , version 1 (09-11-2017)

Identifiants

  • HAL Id : hal-01318085 , version 1

Citer

Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François Bonastre. Imperfect transcript driven speech recognition. INTERSPEECH, Sep 2006, Pittsburgh, United States. ⟨hal-01318085⟩

Collections

UNIV-AVIGNON LIA
52 Consultations
148 Téléchargements

Partager

Gmail Facebook X LinkedIn More