Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions

Carlos Ramisch; Agata Savary; Bruno Guillaume; Jakub Waszczuk; Marie Candito; Ashwini Vaidya; Verginica Barbu Mititelu; Archna Bhatia; Uxoa Iñurrieta; Voula Giouli; Tunga Güngör; Menghan Jiang Polyu; Timm Lichte; Chaya Liebeskind; Johanna Monti; Renata Ramisch; Sara Stymne; Abigail Walsh; Hongzhi Xu

Communication Dans Un Congrès Année : 2020

Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions

(1) , (2) , (3) , (4) , (5) , (6) , (7) , (8) , (9) , (10) , (11) , (12) , (13) , (14) , (15) , (16) , (17) , (18) , (19)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

Carlos Ramisch

Fonction : Auteur
PersonId : 5103
IdHAL : carlos-ramisch
ORCID : 0000-0001-7466-9039
IdRef : 170720802

Traitement Automatique du Langage Ecrit et Parlé

Agata Savary

Fonction : Auteur
PersonId : 4644
IdHAL : agata-savary
IdRef : 113077661

Bases de données et traitement des langues naturelles

Bruno Guillaume

Fonction : Auteur
PersonId : 2082
IdHAL : bruno-guillaume
ORCID : 0000-0001-8314-8075
IdRef : 115863664

Semantic Analysis of Natural Language

Jakub Waszczuk

Fonction : Auteur

Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf]

Marie Candito

Fonction : Auteur
PersonId : 13596
IdHAL : marie-candito
IdRef : 153698616

Laboratoire de Linguistique Formelle

Ashwini Vaidya

Fonction : Auteur

Indian Institute of Technology Delhi

Verginica Barbu Mititelu

Fonction : Auteur

Romanian Academy

Archna Bhatia

Fonction : Auteur

Florida Institute for Human and Machine Cognition [Pensacola]

Uxoa Iñurrieta

Fonction : Auteur

University of the Basque Country = Euskal Herriko Unibertsitatea

Voula Giouli

Fonction : Auteur

ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies

Tunga Güngör

Fonction : Auteur

Boǧaziçi üniversitesi = Boğaziçi University [Istanbul]

Menghan Jiang Polyu

Fonction : Auteur

The Hong Kong Polytechnic University [Hong Kong]

Timm Lichte

Fonction : Auteur

University of Tübingen

Chaya Liebeskind

Fonction : Auteur

Jerusalem College of Technology

Johanna Monti

Fonction : Auteur

Università di Napoli L'Orientale = University of Naples

Renata Ramisch

Fonction : Auteur

NILC, UFSCar

Sara Stymne

Fonction : Auteur

Uppsala University

Abigail Walsh

Fonction : Auteur

Dublin City University [Dublin]

Hongzhi Xu

Fonction : Auteur

Shanghai International Studies University

Résumé

We present edition 1.2 of the PARSEME shared task on identification of verbal multiword expressions (VMWEs). Lessons learned from previous editions indicate that VMWEs have low ambiguity, and that the major challenge lies in identifying test instances never seen in the training data. Therefore, this edition focuses on unseen VMWEs. We have split annotated corpora so that the test corpora contain around 300 unseen VMWEs, and we provide non-annotated raw corpora to be used by complementary discovery methods. We released annotated and raw corpora in 14 languages, and this semi-supervised challenge attracted 7 teams who submitted 9 system results. This paper describes the effort of corpus creation, the task design, and the results obtained by the participating systems, especially their performance on unseen expressions.

Domaines

Informatique et langage [cs.CL]

Fichier principal

main-print.pdf (469.34 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Agata Savary : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03014927

Soumis le : jeudi 19 novembre 2020-16:30:01

Dernière modification le : vendredi 22 mars 2024-18:24:04

Archivage à long terme le : samedi 20 février 2021-20:20:20

Dates et versions

hal-03014927 , version 1 (19-11-2020)

Identifiants

HAL Id : hal-03014927 , version 1

Citer

Carlos Ramisch, Agata Savary, Bruno Guillaume, Jakub Waszczuk, Marie Candito, et al.. Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions. Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 2020, Barcelona, Spain. ⟨hal-03014927⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UNIV-TLN UNIV-TOURS CNRS INRIA UNIV-AMU IRISA LLF LIBDTLN UNIV-LORRAINE INRIA2 CAMPUS-AAR AAI LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC LIS-LAB UNIV-RENNES LIFAT INSA-GROUPE INSA-CVL UP-SOCIETES-HUMANITES ANR UR1-MATH-NUM INCIAM INRIA-BRASIL

134 Consultations

101 Téléchargements

Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager