Learning with noisy supervision for Spoken Language Understanding - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Learning with noisy supervision for Spoken Language Understanding

Résumé

Data-driven Spoken Language Understanding (SLU) systems need semantically annotated data which are expensive, time consuming and prone to human errors. Active learning has been successfully applied to automatic speech recognition and utterance classification. In general, corpora annotation for SLU involves such tasks as sentence segmentation, chunk-ing or frame labeling and predicate-argument annotation. In such cases human annotations are subject to errors increasing with the annotation complexity. We investigate two alternative noise-robust active learning strategies that are either data-intensive or supervision-intensive. The strategies detect likely erroneous examples and improve significantly the SLU performance for a given labeling cost. We apply uncertainty based active learning with conditional random fields on the concept segmentation task for SLU. We perform annotation experiments on two databases, namely ATIS (English) and Media (French). We show that our noise-robust algorithm could improve the accuracy up to 6% (absolute) depending on the noise level and the labeling cost.

Dates et versions

hal-01314633 , version 1 (11-05-2016)

Identifiants

Citer

Christian Raymond, Giuseppe Riccardi. Learning with noisy supervision for Spoken Language Understanding. IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2008, Las Vegas, United States. ⟨10.1109/ICASSP.2008.4518778⟩. ⟨hal-01314633⟩

Collections

UNIV-AVIGNON LIA
40 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More