Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

Emile Chapuis; Pierre Colombo; Matteo Manica; Matthieu Labeau; Chloé Clavel

doi:10.18653/v1/2020.findings-emnlp.239

Communication Dans Un Congrès Année : 2020

Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

(1, 2) , (1, 2) , (3) , (1, 2) , (1, 2)

1
2
3

Emile Chapuis

Fonction : Auteur
PersonId : 1076690

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Pierre Colombo

Fonction : Auteur
PersonId : 1077011

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Matteo Manica

Fonction : Auteur
PersonId : 743646
IdHAL : pierre-colombo

IBM France Lab [Biot]

Matthieu Labeau

Fonction : Auteur
PersonId : 182144
IdHAL : matthieu-labeau
IdRef : 230828426

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Chloé Clavel

Fonction : Auteur
PersonId : 745667
IdHAL : chloe-clavel
ORCID : 0000-0003-4850-3398
IdRef : 116841281

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Résumé

Sequence labelling tasks like Dialog Act and Emotion/Sentiment identification are a key component of spoken dialog systems. In this work, we propose a new approach to learn generic representations adapted to spoken dialog, which we evaluate on a new benchmark we call Sequence labellIng evaLuatIon benChmark fOr spoken laNguagE benchmark (SILICONE). SILICONE is model-agnostic and contains 10 different datasets of various sizes. We obtain our representations with a hierarchical encoder based on transformer architectures, for which we extend two well-known pre-training objectives. Pre-training is performed on OpenSubtitles: a large corpus of spoken dialog containing over 2.3 billion of tokens. We demonstrate how hierarchical encoders achieve competitive results with consistently fewer parameters compared to state-of-the-art models and we show their importance for both pre-training and fine-tuning.

Domaines

Informatique [cs]

Fichier principal

2009.11152 (3.17 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Pierre COLOMBO : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03134851

Soumis le : jeudi 4 janvier 2024-17:55:24

Dernière modification le : vendredi 2 février 2024-03:12:12

Dates et versions

hal-03134851 , version 1 (04-01-2024)

Identifiants

HAL Id : hal-03134851 , version 1
DOI : 10.18653/v1/2020.findings-emnlp.239

Citer

Emile Chapuis, Pierre Colombo, Matteo Manica, Matthieu Labeau, Chloé Clavel. Hierarchical Pre-training for Sequence Labelling in Spoken Dialog. Findings of the Association for Computational Linguistics: EMNLP 2020, Nov 2020, Online, France. pp.2636-2648, ⟨10.18653/v1/2020.findings-emnlp.239⟩. ⟨hal-03134851⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM LTCI IDS S2A IP_PARIS ANR

41 Consultations

4 Téléchargements

Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager