Proposal for an Extension of Traditional Named Entitites: from Guidelines to Evaluation, an Overview

Abstract : Within the framework of the construction of a fact database, we defined guidelines to extract named entities, using a taxonomy based on an extension of the usual named entities defini- tion. We thus defined new types of entities with broader coverage including substantive- based expressions. These extended named en- tities are hierarchical (with types and compo- nents) and compositional (with recursive type inclusion and metonymy annotation). Human annotators used these guidelines to annotate a 1.3M word broadcast news corpus in French. This article presents the definition and novelty of extended named entity annotation guide- lines, the human annotation of a global corpus and of a mini reference corpus, and the evalu- ation of annotations through the computation of inter-annotator agreement. Finally, we dis- cuss our approach and the computed results, and outline further work.
Type de document :
Communication dans un congrès
5th Linguistics Annotation Workshop (The LAW V), Jun 2011, Portland, United States. pp.92--100, 2011
Liste complète des métadonnées

Littérature citée [27 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00604369
Contributeur : Karën Fort <>
Soumis le : mardi 28 juin 2011 - 17:26:20
Dernière modification le : mardi 15 janvier 2019 - 14:54:16
Document(s) archivé(s) le : lundi 12 novembre 2012 - 09:46:01

Fichier

grouin2011law_final.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00604369, version 1

Collections

Citation

Cyril Grouin, Sophie Rosset, Pierre Zweigenbaum, Karën Fort, Olivier Galibert, et al.. Proposal for an Extension of Traditional Named Entitites: from Guidelines to Evaluation, an Overview. 5th Linguistics Annotation Workshop (The LAW V), Jun 2011, Portland, United States. pp.92--100, 2011. 〈hal-00604369〉

Partager

Métriques

Consultations de la notice

1054

Téléchargements de fichiers

443