Cross-Word Sub-Word Units for Low-Resource Keyword Spotting - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Cross-Word Sub-Word Units for Low-Resource Keyword Spotting

Résumé

We investigate the use of sub-word lexical units for the detection of out-of-vocabulary (OOV) keywords in the keyword spotting task. Sub-word units based on morphological decomposition and character ngrams are compared. In particular, we examine the benefit of sub-word units that cross word boundaries. Experiments are performed on the IARPA Babel Turkish dataset. Our results demonstrate that cross-word subword units achieve similar performance on OOV keywords as other types of sub-word units, but can be combined to produce further gains. We also show that sub-word units can be used to improve detection of in-vocabulary keywords. System combination provides a 18\% relative gain in ATWV with the best two systems, and 25\% with the best three systems.
Fichier non déposé

Dates et versions

hal-01843415 , version 1 (18-07-2018)

Identifiants

  • HAL Id : hal-01843415 , version 1

Citer

William Hartmann, Lori Lamel, Jean-Luc Gauvain. Cross-Word Sub-Word Units for Low-Resource Keyword Spotting. International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St. Petersburg, Russia. ⟨hal-01843415⟩
9 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More