ADVERSARIAL BANDIT FOR ONLINE INTERACTIVE ACTIVE LEARNING OF ZERO-SHOT SPOKEN LANGUAGE UNDERSTANDING

Emmanuel Ferreira; Alexandre Reiffers-Masson; Bassam Jabaian; Fabrice Lefèvre

Communication Dans Un Congrès Année : 2016

ADVERSARIAL BANDIT FOR ONLINE INTERACTIVE ACTIVE LEARNING OF ZERO-SHOT SPOKEN LANGUAGE UNDERSTANDING

(1) , (1) , (1) , (1)

Emmanuel Ferreira

Fonction : Auteur
PersonId : 772440
IdRef : 192901885

Laboratoire Informatique d'Avignon

Alexandre Reiffers-Masson

Fonction : Auteur
PersonId : 7844
IdHAL : alexandre-reiffers-masson
ORCID : 0000-0002-4084-1977
IdRef : 196622085

Laboratoire Informatique d'Avignon

Bassam Jabaian

Fonction : Auteur
PersonId : 172824
IdHAL : bassam-jabaian
IdRef : 171425081

Laboratoire Informatique d'Avignon

Fabrice Lefèvre

Fonction : Auteur
PersonId : 175133
IdHAL : fabricelefevre
IdRef : 089427092

Laboratoire Informatique d'Avignon

Résumé

Many state-of-the-art solutions for the understanding of speech data have in common to be probabilistic and to rely on machine learning algorithms to train their models from large amount of data. The difficulty remains in the cost of collecting and annotating such data. Another point is the time for updating an existing model to a new domain. Recent works showed that a zero-shot learning method allows to bootstrap a model with good initial performance. To do so, this method relies on exploiting both a small-sized ontological description of the target domain and a generic word-embedding semantic space for generalization. Then, this framework has been extended to exploit user feedbacks to refine the zero-shot semantic parser parameters and increase its performance online. In this paper, we propose to drive this online adaptive process with a policy learnt using the Adversarial Bandit algorithm Exp3. We show, on the second Dialog State Tracking Challenge (DSTC2) datasets, that this proposition can optimally balance the cost of gathering valuable user feedbacks and the overall performance of the spoken language understanding module. Index Terms-Spoken language understanding, zero-shot learning, bandit problem, out-of-domain training data, online adaptation .

Domaines

Intelligence artificielle [cs.AI]

Bassam Jabaian : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02041621

Soumis le : mercredi 20 février 2019-12:09:24

Dernière modification le : lundi 26 juillet 2021-09:12:02

Dates et versions

hal-02041621 , version 1 (20-02-2019)

Identifiants

HAL Id : hal-02041621 , version 1

Citer

Emmanuel Ferreira, Alexandre Reiffers-Masson, Bassam Jabaian, Fabrice Lefèvre. ADVERSARIAL BANDIT FOR ONLINE INTERACTIVE ACTIVE LEARNING OF ZERO-SHOT SPOKEN LANGUAGE UNDERSTANDING. ICASSP, 2016, shanghai, China. ⟨hal-02041621⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

56 Consultations

0 Téléchargements

ADVERSARIAL BANDIT FOR ONLINE INTERACTIVE ACTIVE LEARNING OF ZERO-SHOT SPOKEN LANGUAGE UNDERSTANDING

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager