Dynamic adjustment of language models for automatic speech recognition using word similarity

Anna Currey; Irina Illina; Dominique Fohr

Communication Dans Un Congrès Année : 2016

Dynamic adjustment of language models for automatic speech recognition using word similarity

(1) , (1) , (1)

Anna Currey

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Speech Modeling for Facilitating Oral-Based Communication

Résumé

Out-of-vocabulary (OOV) words can pose a particular problem for automatic speech recognition (ASR) of broadcast news. The language models (LMs) of ASR systems are typically trained on static corpora, whereas new words (particularly new proper nouns) are continually introduced in the media. Additionally, such OOVs are often content-rich proper nouns that are vital to understanding the topic. In this work, we explore methods for dynamically adding OOVs to language models by adapting the n-gram language model used in our ASR system. We propose two strategies: the first relies on finding in-vocabulary (IV) words similar to the OOVs, where word embeddings are used to define similarity. Our second strategy leverages a small contemporary corpus to estimate OOV probabilities. The models we propose yield improvements in perplexity over the baseline; in addition, the corpus-based approach leads to a significant decrease in proper noun error rate over the baseline in recognition experiments.

Mots clés

ASR language modeling OOV word embeddings lexicon extension

Domaines

Interface homme-machine [cs.HC] Informatique et langage [cs.CL]

Fichier principal

slt-paper (6).pdf (175.19 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Dominique Fohr : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01384365

Soumis le : mercredi 19 octobre 2016-17:22:58

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-01384365 , version 1 (19-10-2016)

Identifiants

HAL Id : hal-01384365 , version 1

Citer

Anna Currey, Irina Illina, Dominique Fohr. Dynamic adjustment of language models for automatic speech recognition using word similarity . IEEE Workshop on Spoken Language Technology (SLT 2016), Dec 2016, San Diego, CA, United States. ⟨hal-01384365⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD ANR

515 Consultations

877 Téléchargements

Dynamic adjustment of language models for automatic speech recognition using word similarity

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager