Skip to Main content Skip to Navigation
Conference papers

Text island spotting in large speech databases

Abstract : This paper addresses the problem of using journalist prompts or closed captions to build corpora for training speech recognition systems. Generally, these text documents are imperfect transcripts which suffer from the lack of timestamps. We propose a method combining a driven decoding algorithm and a fast-match process allowing to spot text-segments. This method is evaluated both on the French ESTER ([4]) corpus and on a large database composed of records from the Radio Television Belge Francophone (RTBF) associated to real prompts. Results show very good performance in terms of spotting; we observed a F-measure of about 98% on spotting the real text island provided by the RTBF corpus. Moreover, the decoding driven by the imperfect transcript island outperforms significantly the baseline system.
Document type :
Conference papers
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon Connect in order to contact the contributor
Submitted on : Saturday, October 29, 2016 - 12:26:22 PM
Last modification on : Thursday, July 2, 2020 - 9:04:07 PM


Files produced by the author(s)


  • HAL Id : hal-01318080, version 1



Benjamin Lecouteux, Georges Linarès, Frédéric Beaugendre, Pascal Nocera. Text island spotting in large speech databases. INTERSPEECH, Aug 2007, Anvers, Belgium. ⟨hal-01318080⟩



Les métriques sont temporairement indisponibles