Text island spotting in large speech databases - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Text island spotting in large speech databases

Résumé

This paper addresses the problem of using journalist prompts or closed captions to build corpora for training speech recognition systems. Generally, these text documents are imperfect transcripts which suffer from the lack of timestamps. We propose a method combining a driven decoding algorithm and a fast-match process allowing to spot text-segments. This method is evaluated both on the French ESTER ([4]) corpus and on a large database composed of records from the Radio Television Belge Francophone (RTBF) associated to real prompts. Results show very good performance in terms of spotting; we observed a F-measure of about 98% on spotting the real text island provided by the RTBF corpus. Moreover, the decoding driven by the imperfect transcript island outperforms significantly the baseline system.
Fichier principal
Vignette du fichier
Text_island_spotting_in_large_speech_databases.pdf (246.92 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01318080 , version 1 (29-10-2016)

Identifiants

  • HAL Id : hal-01318080 , version 1

Citer

Benjamin Lecouteux, Georges Linarès, Frédéric Beaugendre, Pascal Nocera. Text island spotting in large speech databases. INTERSPEECH, Aug 2007, Anvers, Belgium. ⟨hal-01318080⟩

Collections

UNIV-AVIGNON LIA
53 Consultations
52 Téléchargements

Partager

Gmail Facebook X LinkedIn More