Tree Representations in Probabilistic Models for Extended Named Entities Detection - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Tree Representations in Probabilistic Models for Extended Named Entities Detection

Résumé

In this paper we deal with Named Entity Recognition (NER) on transcriptions of French broadcast data. Two aspects make the task more difficult with respect to previous NER tasks: i) named entities annotated used in this work have a tree structure, thus the task cannot be tackled as a sequence labelling task; ii) the data used are more noisy than data used for previous NER tasks. We approach the task in two steps, involving Conditional Random Fields and Probabilistic Context-Free Grammars, integrated in a single parsing algorithm. We analyse the effect of using several tree representations. Our system outperforms the best system of the evaluation campaign by a significant margin.
Fichier principal
Vignette du fichier
EACL2012_DinarelliRosset_XNER.pdf (481.55 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01490007 , version 1 (15-03-2017)

Identifiants

  • HAL Id : hal-01490007 , version 1

Citer

Marco Dinarelli, Sophie Rosset. Tree Representations in Probabilistic Models for Extended Named Entities Detection. European Chapter of the Association for Computational Linguistics, Apr 2012, Avignon, France. ⟨hal-01490007⟩
74 Consultations
52 Téléchargements

Partager

Gmail Facebook X LinkedIn More