Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers

Abstract : This paper compares the reference annotation of structured named entities in two corpora with different origins and properties. It ad- dresses two questions linked to such a comparison. On the one hand, what specific issues were raised by reusing the same annotation scheme on a corpus that differs from the first in terms of media and that predates it by more than a century? On the other hand, what contrasts were observed in the resulting annotations across the two corpora?
Type de document :
Communication dans un congrès
6th Linguistics Annotation Workshop (The LAW VI), Jul 2012, Jeju, South Korea. pp.40-48, 2012
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00709193
Contributeur : Karën Fort <>
Soumis le : lundi 18 juin 2012 - 10:37:33
Dernière modification le : mardi 15 janvier 2019 - 14:54:16
Document(s) archivé(s) le : jeudi 15 décembre 2016 - 15:37:50

Fichier

base2012law_finalefinale.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00709193, version 1

Collections

Citation

Sophie Rosset, Cyril Grouin, Karën Fort, Olivier Galibert, Juliette Kahn, et al.. Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers. 6th Linguistics Annotation Workshop (The LAW VI), Jul 2012, Jeju, South Korea. pp.40-48, 2012. 〈hal-00709193〉

Partager

Métriques

Consultations de la notice

226

Téléchargements de fichiers

434