A comparison of sequential and combined approaches for named entity recognition in a corpus of handwritten medieval charters - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

A comparison of sequential and combined approaches for named entity recognition in a corpus of handwritten medieval charters

Résumé

This paper introduces a new corpus of multilin-gual medieval handwritten charter images, annotated with fulltranscription and named entities. The corpus is used to com-pare two approaches for named entity recognition in historicaldocument images in several languages: on the one hand, asequential approach, more commonly used, that sequentiallyapplies handwritten text recognition (HTR) and named entityrecognition (NER), on the other hand, a combined approachthat simultaneously transcribes the image text line and extractsthe entities. Experiments conducted on the charter corpus inLatin, early new high German and old Czech for name, dateand location recognition demonstrate a superior performance ofthe combined approach.
Fichier non déposé

Dates et versions

hal-02935087 , version 1 (10-09-2020)

Identifiants

Citer

Emanuela Boroş, Verónica Romero, Martin Maarand, Dominique Stutzmann, Kateřina Zenklová, et al.. A comparison of sequential and combined approaches for named entity recognition in a corpus of handwritten medieval charters. 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), Sep 2020, Dortmund, Germany. pp.79-84, ⟨10.1109/ICFHR2020.2020.00025⟩. ⟨hal-02935087⟩
194 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More