Towards a Variability Measure for Multiword Expressions

Abstract : One of the outstanding properties of multi-word expressions (MWEs), especially verbal ones (VMWEs), important both in theoretical models and applications, is their idiosyncratic variability. Some MWEs are always continuous , while some others admit certain types of insertions. Components of some MWEs are rarely or never modified, while some others admit either specific or unrestricted modification. This unpredictable variability profile of MWEs hinders modeling and processing them as " words-with-spaces " on the one hand, and as regular syntactic structures on the other hand. Since variability of MWEs is a matter of scale rather than a binary property, we propose a 2-dimensional language-independent measure of variability dedicated to verbal MWEs based on syntactic and discontinuity-related clues. We assess its relevance with respect to a linguistic benchmark and its utility for the tasks of VMWE classification and variant identification on a French corpus.
Type de document :
Communication dans un congrès
NAACL, Jun 2018, New Orleans, United States
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01802238
Contributeur : Caroline Pasquer <>
Soumis le : mardi 29 mai 2018 - 10:31:12
Dernière modification le : mardi 9 octobre 2018 - 11:46:07
Document(s) archivé(s) le : jeudi 30 août 2018 - 13:21:30

Fichier

NAACL_final.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01802238, version 1

Collections

Citation

Caroline Pasquer, Agata Savary, Jean-Yves Antoine, Carlos Ramisch. Towards a Variability Measure for Multiword Expressions. NAACL, Jun 2018, New Orleans, United States. 〈hal-01802238〉

Partager

Métriques

Consultations de la notice

69

Téléchargements de fichiers

27