Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction!

Bruno Taillé; Vincent Guigue; Geoffrey Scoutheeten; Patrick Gallinari

doi:10.18653/v1/2020.emnlp-main.301

Communication Dans Un Congrès Année : 2020

Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction!

(1, 2) , (1) , (2) , (1, 3)

1
2
3

Bruno Taillé

Fonction : Auteur

Machine Learning and Information Access

BNP-Paribas

Vincent Guigue

Fonction : Auteur

Machine Learning and Information Access

Geoffrey Scoutheeten

Fonction : Auteur

BNP-Paribas

Patrick Gallinari

Fonction : Auteur
PersonId : 751615
IdHAL : patrick-gallinari
ORCID : 0000-0001-9060-9001
IdRef : 070709076

Machine Learning and Information Access

Criteo AI Lab

Résumé

Despite efforts to distinguish three different evaluation setups (Bekoulis et al., 2018), numerous end-to-end Relation Extraction (RE) articles present unreliable performance comparison to previous work. In this paper, we first identify several patterns of invalid comparisons in published papers and describe them to avoid their propagation. We then propose a small empirical study to quantify the most common mistake’s impact and evaluate it leads to overestimating the final RE performance by around 5% on ACE05. We also seize this opportunity to study the unexplored ablations of two recent developments: the use of language model pretraining (specifically BERT) and span-level NER. This meta-analysis emphasizes the need for rigor in the report of both the evaluation setting and the dataset statistics. We finally call for unifying the evaluation setting in end-to-end RE.

Domaines

Informatique et langage [cs.CL]

Bruno Taillé : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03480380

Soumis le : mardi 14 décembre 2021-16:17:56

Dernière modification le : samedi 7 octobre 2023-21:36:22

Dates et versions

hal-03480380 , version 1 (14-12-2021)

Identifiants

HAL Id : hal-03480380 , version 1
DOI : 10.18653/v1/2020.emnlp-main.301

Citer

Bruno Taillé, Vincent Guigue, Geoffrey Scoutheeten, Patrick Gallinari. Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction!. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Nov 2020, Punta Cana (Online), Dominican Republic. pp.3689-3701, ⟨10.18653/v1/2020.emnlp-main.301⟩. ⟨hal-03480380⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

16 Consultations

0 Téléchargements

Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction!

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager