Investigating the stability of concrete nouns in word embeddings

Bénédicte Pierrejean; Ludovic Tanguy

Communication Dans Un Congrès Année : 2019

Investigating the stability of concrete nouns in word embeddings

(1) , (2)

1
2

Bénédicte Pierrejean

Fonction : Auteur

Cognition, Langues, Langage, Ergonomie

Ludovic Tanguy

Fonction : Auteur
PersonId : 34
IdHAL : ludovic-tanguy
IdRef : 11839777X

Equipe de Recherche en Syntaxe et Sémantique

Résumé

We know that word embeddings trained using neural-based methods (such as word2vec SGNS) are sensitive to stability problems and that across two models trained using the exact same set of parameters, the nearest neighbors of a word are likely to change. All words are not equally impacted by this internal instability and recent studies have investigated features influencing the stability of word embeddings. This stability can be seen as a clue for the reliability of the semantic representation of a word. In this work, we investigate the influence of the degree of concreteness of nouns on the stability of their semantic representation. We show that for English generic corpora, abstract words are more affected by stability problems than concrete words. We also found that to a certain extent, the difference between the degree of concreteness of a noun and its nearest neighbors can partly explain the stability or instability of its neighbors.

Domaines

Linguistique Informatique et langage [cs.CL]

Fichier principal

iwcs.pdf (116.93 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Bénédicte Pierrejean : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02073705

Soumis le : lundi 8 juillet 2019-16:03:35

Dernière modification le : vendredi 19 avril 2024-16:18:56

Dates et versions

hal-02073705 , version 1 (08-07-2019)

Identifiants

HAL Id : hal-02073705 , version 1

Citer

Bénédicte Pierrejean, Ludovic Tanguy. Investigating the stability of concrete nouns in word embeddings. 13th International Conference on Computational Semantics, May 2019, Gothenburg, Sweden. ⟨hal-02073705⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EPHE UNIV-TLSE2 CNRS CLLE PSL UNIV-BORDEAUX-MONTAIGNE

84 Consultations

134 Téléchargements

Investigating the stability of concrete nouns in word embeddings

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager