Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Alexis Conneau; Douwe Kiela; Holger Schwenk; Loïc Barrault; Antoine Bordes

doi:10.18653/v1/D17-1070

Communication Dans Un Congrès Année : 2017

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

, , (1) , (1) ,

Alexis Conneau

Fonction : Auteur

Douwe Kiela

Fonction : Auteur

Holger Schwenk

Fonction : Auteur
PersonId : 910684

Laboratoire d'Informatique de l'Université du Mans

Loïc Barrault

Fonction : Auteur
PersonId : 15276
IdHAL : loicbarrault
ORCID : 0000-0002-0634-6147
IdRef : 131912488

Laboratoire d'Informatique de l'Université du Mans

Antoine Bordes

Fonction : Auteur

Résumé

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, we show how universal sentence representations trained using the supervised data of the Stanford Natural Language Inference datasets can consistently outperform unsupervised methods like SkipThought vectors on a wide range of transfer tasks. Much like how computer vision uses ImageNet to obtain features, which can then be transferred to other tasks, our work tends to indicate the suitability of natural language inference for transfer learning to other NLP tasks. Our encoder is publicly available.

Domaines

Informatique et langage [cs.CL]

Loïc BARRAULT : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01897968

Soumis le : mercredi 17 octobre 2018-18:10:06

Dernière modification le : mardi 10 mars 2020-11:52:41

Dates et versions

hal-01897968 , version 1 (17-10-2018)

Identifiants

HAL Id : hal-01897968 , version 1
ARXIV : 1705.02364
DOI : 10.18653/v1/D17-1070

Citer

Alexis Conneau, Douwe Kiela, Holger Schwenk, Loïc Barrault, Antoine Bordes. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Sep 2017, Copenhagen, Denmark, Denmark. pp.670-680, ⟨10.18653/v1/D17-1070⟩. ⟨hal-01897968⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LEMANS LIUM LIUM-LST

296 Consultations

0 Téléchargements

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager