Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems - Archive ouverte HAL Accéder directement au contenu
Chapitre D'ouvrage Année : 2020

Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems

Résumé

Scarcity of training data for task-oriented dialogue systemsis a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to relyon automatic text generation which, although less accurate than humansupervision, has the advantage of being cheap and fast. Our contributionis twofold. First we show how to optimally train and control the generationof intent-specific sentences using a conditional variational autoencoder.Then we introduce a new protocol calledquery transferthat allows toleverage a large unlabelled dataset, possibly containing irrelevant queries,to extract relevant information. Comparison with two different baselinesshows that this method, in the appropriate regime, consistently improvesthe diversity of the generated queries without compromising their quality.We also demonstrate the effectiveness of our generation method as a dataaugmentation technique for language modelling tasks.

Dates et versions

hal-03116563 , version 1 (20-01-2021)

Identifiants

Citer

Stéphane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier, Marc Lelarge. Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems. Luis Espinosa-Anke; Carlos Martín-Vide; Irena Spasić. International Conference on Statistical Language and Speech Processing, Springer, pp.23-34, 2020, ⟨10.1007/978-3-030-59430-5_2⟩. ⟨hal-03116563⟩
56 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More