Segmentation-free speech text recognition for comic books

Abstract : Speech text in comic books is written in a particular manner by the scriptwriter which raises unusual challenges for text recognition. We first detail these challenges and present different approaches to solve them. We compare the performances of pre-trained OCR and segmentation-free approach for speech text of comic books written in Latin script. We demonstrate that few good quality pre-trained OCR output samples, associated with other unlabeled data with the same writing style, can feed a segmentation-free OCR and improve text recognition. Thanks to the help of the lexi-cality measure that automatically accept or reject the pre-trained OCR output as pseudo ground truth for a subsequent segmentation-free OCR training and recognition.
Type de document :
Communication dans un congrès
2nd International Workshop on coMics Analysis, Processing, and Understanding (MANPU), Nov 2017, Kyoto, Japan. IEEE, 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017. 〈10.1109/ICDAR.2017.288〉
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01719619
Contributeur : Christophe Rigaud <>
Soumis le : vendredi 2 mars 2018 - 09:52:58
Dernière modification le : samedi 3 mars 2018 - 01:01:08
Document(s) archivé(s) le : jeudi 31 mai 2018 - 13:00:59

Identifiants

Collections

Citation

Christophe Rigaud, Jean-Christophe Burie, Jean-Marc Ogier. Segmentation-free speech text recognition for comic books. 2nd International Workshop on coMics Analysis, Processing, and Understanding (MANPU), Nov 2017, Kyoto, Japan. IEEE, 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017. 〈10.1109/ICDAR.2017.288〉. 〈hal-01719619〉

Partager

Métriques

Consultations de la notice

30

Téléchargements de fichiers

64