Person name recognition in ASR outputs using continuous context models - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Person name recognition in ASR outputs using continuous context models

Résumé

The detection and characterization, in audiovisual documents, of speech utterances where person names are pronounced, is an important cue for spoken content analysis. This paper tackles the problematic of retrieving spoken person names in the 1-Best ASR outputs of broadcast TV shows. Our assumption is that a person name is a latent variable produced by the lexical context it appears in. Thereby, a spoken name could be derived from ASR outputs even if it has not been proposed by the speech recognition system. A new context modelling is proposed in order to capture lexical and structural information surrounding a spoken name. The fundamental hypothesis of this study has been validated on broadcast TV documents available in the context of the REPERE challenge.
Fichier non déposé

Dates et versions

hal-01314411 , version 1 (11-05-2016)

Identifiants

Citer

Benjamin Bigot, Grégory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour. Person name recognition in ASR outputs using continuous context models. IEEE International Conference on Acoustics, Speech and Signal Processing , May 2013, Vancouver, Canada. ⟨10.1109/ICASSP.2013.6639318⟩. ⟨hal-01314411⟩

Collections

UNIV-AVIGNON LIA
206 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More