Exploring temporal context in diachronic text documents for automatic OOV proper name retrieval - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Exploring temporal context in diachronic text documents for automatic OOV proper name retrieval

Résumé

Proper name recognition is a challenging task in information retrieval in large audio/video databases. Proper names are semantically rich and are usually key to understanding the information contained in a document. Our work focuses on increasing the vocabulary coverage of a speech transcription system by automatically retrieving proper names from contemporary diachronic text documents. We proposed methods that dynamically augment the automatic speech recognition system vocabulary, using lexical and temporal features in diachronic documents. We also studied different metrics for proper name selection in order to limit the vocabulary augmentation and therefore the impact on the ASR performances. Recognition results show a significant reduction of the word error rate using augmented vocabulary.
Fichier non déposé

Dates et versions

hal-00924696 , version 1 (07-01-2014)

Identifiants

  • HAL Id : hal-00924696 , version 1

Citer

Imane Nkairi, Irina Illina, Georges Linarès, Dominique Fohr. Exploring temporal context in diachronic text documents for automatic OOV proper name retrieval. Language & Technology Conference, Dec 2013, Poznań, Poland. pp.540-544. ⟨hal-00924696⟩
199 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More