Dynamic Extension of ASR Lexicon Using Wikipedia Data

Badr Abdullah; Irina Illina; Dominique Fohr

Communication Dans Un Congrès Année : 2018

Dynamic Extension of ASR Lexicon Using Wikipedia Data

(1) , (1) , (1)

Badr Abdullah

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Speech Modeling for Facilitating Oral-Based Communication

Résumé

Despite recent progress in developing Large Vocabulary Continuous Speech Recognition Systems (LVCSR), these systems suffer from Out-Of-Vocabulary words (OOV). In many cases, the OOV words are Proper Nouns (PNs). The correct recognition of PNs is essential for broadcast news, audio indexing, etc. In this article, we address the problem of OOV PN retrieval in the framework of broadcast news LVCSR. We focused on dynamic (document dependent) extension of LVCSR lexicon. To retrieve relevant OOV PNs, we propose to use a very large multipurpose text corpus: Wikipedia. This corpus contains a huge number of PNs. These PNs are grouped in semantically similar classes using word embedding. We use a two-step approach: first, we select OOV PN pertinent classes with a multi-class Deep Neural Network (DNN). Secondly, we rank the OOVs of the selected classes. The experiments on French broadcast news show that the Bi-GRU model outperforms other studied models. Speech recognition experiments demonstrate the effectiveness of the proposed methodology.

Mots clés

word embedding lexicon extension out-of-vocabulary words Automatic speech recognition

Domaines

Informatique [cs]

Fichier principal

Abdullah.pdf (584.21 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Irina Illina : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01874495

Soumis le : vendredi 14 septembre 2018-13:11:57

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : samedi 15 décembre 2018-15:20:47

Dates et versions

hal-01874495 , version 1 (14-09-2018)

Identifiants

HAL Id : hal-01874495 , version 1

Citer

Badr Abdullah, Irina Illina, Dominique Fohr. Dynamic Extension of ASR Lexicon Using Wikipedia Data. IEEE Workshop on Spoken and Language Technology (SLT), Dec 2018, Athènes, Greece. ⟨hal-01874495⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD IMPACT-OLKI SILECS

174 Consultations

303 Téléchargements

Dynamic Extension of ASR Lexicon Using Wikipedia Data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager