Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions

Bruno Guillaume
Marie Candito
Renata Ramisch
  • Fonction : Auteur
Sara Stymne
  • Fonction : Auteur

Résumé

We present edition 1.2 of the PARSEME shared task on identification of verbal multiword expressions (VMWEs). Lessons learned from previous editions indicate that VMWEs have low ambiguity, and that the major challenge lies in identifying test instances never seen in the training data. Therefore, this edition focuses on unseen VMWEs. We have split annotated corpora so that the test corpora contain around 300 unseen VMWEs, and we provide non-annotated raw corpora to be used by complementary discovery methods. We released annotated and raw corpora in 14 languages, and this semi-supervised challenge attracted 7 teams who submitted 9 system results. This paper describes the effort of corpus creation, the task design, and the results obtained by the participating systems, especially their performance on unseen expressions.
Fichier principal
Vignette du fichier
main-print.pdf (469.34 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03014927 , version 1 (19-11-2020)

Identifiants

  • HAL Id : hal-03014927 , version 1

Citer

Carlos Ramisch, Agata Savary, Bruno Guillaume, Jakub Waszczuk, Marie Candito, et al.. Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions. Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 2020, Barcelona, Spain. ⟨hal-03014927⟩
134 Consultations
101 Téléchargements

Partager

Gmail Facebook X LinkedIn More