Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers

Abstract : This paper compares the reference annotation of structured named entities in two corpora with different origins and properties. It ad- dresses two questions linked to such a comparison. On the one hand, what specific issues were raised by reusing the same annotation scheme on a corpus that differs from the first in terms of media and that predates it by more than a century? On the other hand, what contrasts were observed in the resulting annotations across the two corpora?
Document type :
Conference papers
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00709193
Contributor : Karën Fort <>
Submitted on : Monday, June 18, 2012 - 10:37:33 AM
Last modification on : Saturday, February 15, 2020 - 1:49:54 AM
Long-term archiving on: Thursday, December 15, 2016 - 3:37:50 PM

File

base2012law_finalefinale.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00709193, version 1

Citation

Sophie Rosset, Cyril Grouin, Karen Fort, Olivier Galibert, Juliette Kahn, et al.. Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers. 6th Linguistics Annotation Workshop (The LAW VI), Jul 2012, Jeju, South Korea. pp.40-48. ⟨hal-00709193⟩

Share

Metrics

Record views

278

Files downloads

503