ONLINE ADAPTATIVE ZERO-SHOT LEARNING SPOKEN LANGUAGE UNDERSTANDING USING WORD-EMBEDDING

Abstract : Many recent competitive state-of-the-art solutions for understanding of speech data have in common to be probabilistic and to rely on machine learning algorithms to train their models from large amount of data. The difficulty remains in the cost and time of collecting and annotating such data, but also to update the existing models to new conditions, tasks and/or languages. In the present work an approach based on a zero-shot learning method using word embeddings for spoken language understanding is investigated. This approach requires no dedicated data. Large amounts of un-annotated and un-structured found data are used to learn a continuous space vector representation of words, based on neural network ar-chitectures. Only the ontological description of the target domain and the generic word embedding features are then required to derive the model used for decoding. In this paper, we extend this baseline with an online adaptative strategy allowing to refine progressively the initial model with only a light and adjustable supervision. We show that this proposition can significantly improve the performance of the spoken language understanding module on the second Dialog State Tracking Challenge (DSTC2) datasets. Index Terms-Spoken language understanding, word embedding, zero-shot learning, out-of-domain training data, online adaptation.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-02042298
Contributor : Bassam Jabaian <>
Submitted on : Wednesday, February 20, 2019 - 12:17:38 PM
Last modification on : Wednesday, May 15, 2019 - 10:12:03 AM

Identifiers

  • HAL Id : hal-02042298, version 1

Collections

Citation

Emmanuel Ferreira, Bassam Jabaian, Fabrice Lefèvre. ONLINE ADAPTATIVE ZERO-SHOT LEARNING SPOKEN LANGUAGE UNDERSTANDING USING WORD-EMBEDDING. ICASSP, 2015, Brisban, Australia. ⟨hal-02042298⟩

Share

Metrics

Record views

15