Text island spotting in large speech databases

Benjamin Lecouteux; Georges Linarès; Frédéric Beaugendre; Pascal Nocera

Communication Dans Un Congrès Année : 2007

Text island spotting in large speech databases

(1) , (1) , , (1)

Benjamin Lecouteux

Fonction : Auteur
PersonId : 7847
IdHAL : benjamin-lecouteux
ORCID : 0000-0003-3000-6190
IdRef : 135355060

Laboratoire Informatique d'Avignon

Georges Linarès

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Frédéric Beaugendre

Fonction : Auteur

Pascal Nocera

Fonction : Auteur

Laboratoire Informatique d'Avignon

Résumé

This paper addresses the problem of using journalist prompts or closed captions to build corpora for training speech recognition systems. Generally, these text documents are imperfect transcripts which suffer from the lack of timestamps. We propose a method combining a driven decoding algorithm and a fast-match process allowing to spot text-segments. This method is evaluated both on the French ESTER ([4]) corpus and on a large database composed of records from the Radio Television Belge Francophone (RTBF) associated to real prompts. Results show very good performance in terms of spotting; we observed a F-measure of about 98% on spotting the real text island provided by the RTBF corpus. Moreover, the decoding driven by the imperfect transcript island outperforms significantly the baseline system.

Mots clés

speech recognition closed captioning corpus building

Domaines

Informatique [cs]

Fichier principal

Text_island_spotting_in_large_speech_databases.pdf (246.92 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01318080

Soumis le : samedi 29 octobre 2016-12:26:22

Dernière modification le : mardi 22 mars 2022-14:40:01

Dates et versions

hal-01318080 , version 1 (29-10-2016)

Identifiants

HAL Id : hal-01318080 , version 1

Citer

Benjamin Lecouteux, Georges Linarès, Frédéric Beaugendre, Pascal Nocera. Text island spotting in large speech databases. INTERSPEECH, Aug 2007, Anvers, Belgium. ⟨hal-01318080⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

53 Consultations

52 Téléchargements

Text island spotting in large speech databases

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager