Latent Semantic Word Sense Induction and Disambiguation

Abstract : In this paper, we present a unified model for the automatic induction of word senses from text, and the subsequent disambiguation of particular word instances using the automatically extracted sense inventory. The induction step and the disambiguation step are based on the same principle: words and contexts are mapped to a limited number of topical dimensions in a latent semantic word space. The intuition is that a particular sense is associated with a particular topic, so that different senses can be discriminated through their association with particular topical dimensions; in a similar vein, a particular instance of a word can be disambiguated by determining its most important topical dimensions. The model is evaluated on the SemEval-2010 word sense induction and disambiguation task, on which it reaches state-of-the-art results.
Type de document :
Communication dans un congrès
ACL HLT 2011 - 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Jun 2011, Portland, Oregon, United States. pp.1476--1485, 2011
Liste complète des métadonnées

Littérature citée [25 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00607672
Contributeur : Marianna Apidianaki <>
Soumis le : dimanche 10 juillet 2011 - 18:09:22
Dernière modification le : vendredi 4 janvier 2019 - 17:33:24
Document(s) archivé(s) le : lundi 12 novembre 2012 - 10:35:42

Fichier

Van_de_Cruys_Apidianaki_ACL-HL...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00607672, version 1

Collections

Citation

Tim Van de Cruys, Marianna Apidianaki. Latent Semantic Word Sense Induction and Disambiguation. ACL HLT 2011 - 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Jun 2011, Portland, Oregon, United States. pp.1476--1485, 2011. 〈hal-00607672〉

Partager

Métriques

Consultations de la notice

645

Téléchargements de fichiers

244