Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign

Karen Fort; Claire François; Olivier Galibert; Maha Ghribi

Communication Dans Un Congrès Année : 2012

Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign

(1) , (2) , (3) , (2)

1
2
3

Karen Fort

Fonction : Auteur
PersonId : 2215
IdHAL : karen-fort
ORCID : 0000-0002-0723-8850
IdRef : 176299548

Laboratoire d'Informatique de Paris-Nord

Claire François

Fonction : Auteur
PersonId : 835424

Institut de l'information scientifique et technique

Olivier Galibert

Fonction : Auteur
PersonId : 1034095
IdRef : 136783457

Laboratoire commun de métrologie LNE-CNAM

Maha Ghribi

Fonction : Auteur
PersonId : 865717

Institut de l'information scientifique et technique

Résumé

This article details work aiming at evaluating the quality of the manual annotation of gene renaming couples in scientific abstracts, which generates sparse annotations. To evaluate these annotations, we compare the results obtained using the commonly advocated inter-annotator agreement coefficients such as S, κ and π, the less known R, the weighted coefficients κω and α as well as the F-measure and the SER. We analyze to which extent they are relevant for our data. We then study the bias introduced by prevalence by changing the way the contingency table is built. We finally propose an original way to synthesize the results by computing distances between categories, based on the produced annotations.

Mots clés

Manual annotation evaluation Inter-annotator agreement Prevalence

Domaines

Traitement du texte et du document

Fichier principal

lrec-iaa2012_VFinale_Submitted.pdf (115.24 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Karën Fort : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00709174

Soumis le : lundi 18 juin 2012-10:01:12

Dernière modification le : mercredi 17 mai 2023-16:12:49

Archivage à long terme le : mercredi 19 septembre 2012-02:31:18

Dates et versions

hal-00709174 , version 1 (18-06-2012)

Licence

Paternité

Identifiants

HAL Id : hal-00709174 , version 1

Citer

Karen Fort, Claire François, Olivier Galibert, Maha Ghribi. Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign. International Conference on Language Resources and Evaluation (LREC), May 2012, Istanbul, Turkey. ⟨hal-00709174⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS13 CNRS CNAM QUAERO LIPN LNE GALILE LNE-CNAM SORBONNE-PARIS-NORD INIST HESAM

197 Consultations

264 Téléchargements

Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager