Person name recognition in ASR outputs using continuous context models

Benjamin Bigot; Grégory Senay; Georges Linarès; Corinne Fredouille; Richard Dufour

doi:10.1109/ICASSP.2013.6639318

Communication Dans Un Congrès Année : 2013

Person name recognition in ASR outputs using continuous context models

(1) , (1) , (1) , (1) , (1)

Benjamin Bigot

Fonction : Auteur

Laboratoire Informatique d'Avignon

Grégory Senay

Fonction : Auteur

Laboratoire Informatique d'Avignon

Georges Linarès

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Corinne Fredouille

Fonction : Auteur
PersonId : 173870
IdHAL : corinne-fredouille
ORCID : 0000-0002-0413-8950
IdRef : 079420516

Laboratoire Informatique d'Avignon

Richard Dufour

Fonction : Auteur
PersonId : 178348
IdHAL : richard-dufour
ORCID : 0000-0003-1203-9108

Laboratoire Informatique d'Avignon

Résumé

The detection and characterization, in audiovisual documents, of speech utterances where person names are pronounced, is an important cue for spoken content analysis. This paper tackles the problematic of retrieving spoken person names in the 1-Best ASR outputs of broadcast TV shows. Our assumption is that a person name is a latent variable produced by the lexical context it appears in. Thereby, a spoken name could be derived from ASR outputs even if it has not been proposed by the speech recognition system. A new context modelling is proposed in order to capture lexical and structural information surrounding a spoken name. The fundamental hypothesis of this study has been validated on broadcast TV documents available in the context of the REPERE challenge.

Mots clés

Index Terms— spoken document retrieval spoken name detection lexical context representation Index Terms— spoken document retrieval Index Terms— spoken document retrieval Index Terms— spoken document retrieval

Domaines

Informatique [cs]

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01314411

Soumis le : mercredi 11 mai 2016-14:00:34

Dernière modification le : mardi 22 mars 2022-14:40:01

Dates et versions

hal-01314411 , version 1 (11-05-2016)

Identifiants

HAL Id : hal-01314411 , version 1
DOI : 10.1109/ICASSP.2013.6639318

Citer

Benjamin Bigot, Grégory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour. Person name recognition in ASR outputs using continuous context models. IEEE International Conference on Acoustics, Speech and Signal Processing , May 2013, Vancouver, Canada. ⟨10.1109/ICASSP.2013.6639318⟩. ⟨hal-01314411⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

206 Consultations

0 Téléchargements

Person name recognition in ASR outputs using continuous context models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager