Detection of OOV words by combining acoustic confidence measures with linguistic features

Frederik Stouten 1 Dominique Fohr 1 Irina Illina 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper describes the design of an Out-Of- Vocabulary words (OOV) detector. Such a system is assumed to detect segments that correspond to OOV words (words that are not included in the lexicon) in the output of a LVCSR system. The OOV detector uses acoustic confidence measures that are derived from several systems: a word recognizer constrained by a lexicon, a phone recognizer constrained by a grammar and a phone recognizer without constraints. On top of that it also uses some linguistic features. The experimental results on a French broadcast news transcription task showed that for our approach precision equals recall at 35%
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00435087
Contributor : Dominique Fohr <>
Submitted on : Monday, November 23, 2009 - 3:30:13 PM
Last modification on : Thursday, January 11, 2018 - 6:19:56 AM

Identifiers

  • HAL Id : hal-00435087, version 1

Collections

Citation

Frederik Stouten, Dominique Fohr, Irina Illina. Detection of OOV words by combining acoustic confidence measures with linguistic features. The eleventh biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 2009, Merano, Italy. pp.1-4. ⟨hal-00435087⟩

Share

Metrics

Record views

294