Lexical encoding of multiword expressions in XMG - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Lexical encoding of multiword expressions in XMG

Résumé

Multiword expressions (MWEs) exhibit both regular and idiosyncratic properties. Their idiosyncrasy requires lexical encoding in parallel with their component words. Their (at times intricate) regularity, on the other hand, calls for means of flexible factorization to avoid redundant descriptions of shared properties. However, so far, non-redundant general-purpose lexical encoding of MWEs has not received a satisfactory solution. We offer a proof of concept that this challenge might be effectively addressed within eXtensible MetaGrammar (XMG), an object-oriented metagrammar framework. We first make an existing metagrammatical resource, the FrenchTAG grammar, MWE-aware. We then evaluate the factorization gain during incremental implementation with XMG on a dataset extracted from an MWE-annotated reference corpus.
Fichier principal
Vignette du fichier
12.pdf (228.3 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03047145 , version 1 (03-01-2021)

Identifiants

  • HAL Id : hal-03047145 , version 1

Citer

Agata Savary, Simon Petitjean, Timm Lichte, Laura Kallmeyer, Jakub Waszczuk. Lexical encoding of multiword expressions in XMG. 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), Dec 2020, Montrouge, France. pp.60-63. ⟨hal-03047145⟩

Collections

UNIV-TOURS
31 Consultations
17 Téléchargements

Partager

Gmail Facebook X LinkedIn More