Effects of Clustering Algorithms on Typographic Reconstruction

Elisa H. Barney Smith 1, * Bart Lamiroy 2, *
* Auteur correspondant
2 QGAR - Querying Graphics through Analysis and Recognition
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Type designers and historians studying the type-faces and fonts used in historical documents can usually only rely on available printed material. The initial wooden or metal cast fonts have mostly disappeared. In this paper we address the creation of character templates from printed documents. Images of characters scanned from Renaissance era documents are segmented, then clustered and a template is created from each obtained cluster of similar appearance characters. In order for subsequent typeface analysis tools to operate, the template should reduce the noise present in the individual instances by using information from the set of samples, but the samples must be homogeneous enough to not introduce further noise into the process. This paper evaluates the efficiency of several clustering algorithms and the associated parameters through cluster validity statistics and appearance of the resulting template image. Clustering algorithms that form tight clusters produce templates that highlight details, even though the number of available samples is smaller, while algorithms with larger clusters better capture the global shape of the characters.
Type de document :
Communication dans un congrès
13th International Conference on Document Analysis and Recognition, Aug 2015, Nancy, France. 〈http://2015.icdar.org/〉
Liste complète des métadonnées

Littérature citée [11 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01154603
Contributeur : Bart Lamiroy <>
Soumis le : vendredi 22 mai 2015 - 15:23:27
Dernière modification le : mardi 18 décembre 2018 - 16:38:34
Document(s) archivé(s) le : jeudi 20 avril 2017 - 07:15:01

Fichier

TypographyICDAR.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-01154603, version 1

Collections

Citation

Elisa H. Barney Smith, Bart Lamiroy. Effects of Clustering Algorithms on Typographic Reconstruction. 13th International Conference on Document Analysis and Recognition, Aug 2015, Nancy, France. 〈http://2015.icdar.org/〉. 〈hal-01154603〉

Partager

Métriques

Consultations de la notice

309

Téléchargements de fichiers

153