Skip to Main content Skip to Navigation
Conference papers

A comparison of sequential and combined approaches for named entity recognition in a corpus of handwritten medieval charters

Abstract : This paper introduces a new corpus of multilin-gual medieval handwritten charter images, annotated with fulltranscription and named entities. The corpus is used to com-pare two approaches for named entity recognition in historicaldocument images in several languages: on the one hand, asequential approach, more commonly used, that sequentiallyapplies handwritten text recognition (HTR) and named entityrecognition (NER), on the other hand, a combined approachthat simultaneously transcribes the image text line and extractsthe entities. Experiments conducted on the charter corpus inLatin, early new high German and old Czech for name, dateand location recognition demonstrate a superior performance ofthe combined approach.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-02935087
Contributor : Dominique Stutzmann <>
Submitted on : Thursday, September 10, 2020 - 8:43:10 AM
Last modification on : Friday, September 11, 2020 - 7:44:43 AM

Identifiers

Collections

Citation

Emanuela Boroş, Verónica Romero, Martin Maarand, Dominique Stutzmann, Kateřina Zenklová, et al.. A comparison of sequential and combined approaches for named entity recognition in a corpus of handwritten medieval charters. 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), Sep 2020, Dortmund, Germany. pp.79-84, ⟨10.1109/ICFHR2020.2020.00025⟩. ⟨hal-02935087⟩

Share

Metrics

Record views

34