Segmentation-free speech text recognition for comic books

Christophe Rigaud; Jean-Christophe Burie; Jean-Marc Ogier

doi:10.1109/ICDAR.2017.288

Communication Dans Un Congrès Année : 2017

Segmentation-free speech text recognition for comic books

Reconnaissance du texte de bandes dessinées sans segmentation

(1) , (1) , (1)

Christophe Rigaud

Fonction : Auteur

Laboratoire Informatique, Image et Interaction - EA 2118

Jean-Christophe Burie

Fonction : Auteur
PersonId : 735515
IdHAL : jean-christophe-burie
ORCID : 0000-0001-7323-2855
IdRef : 119612151

Laboratoire Informatique, Image et Interaction - EA 2118

Jean-Marc Ogier

Fonction : Auteur
PersonId : 833747

Laboratoire Informatique, Image et Interaction - EA 2118

Résumé

Speech text in comic books is written in a particular manner by the scriptwriter which raises unusual challenges for text recognition. We first detail these challenges and present different approaches to solve them. We compare the performances of pre-trained OCR and segmentation-free approach for speech text of comic books written in Latin script. We demonstrate that few good quality pre-trained OCR output samples, associated with other unlabeled data with the same writing style, can feed a segmentation-free OCR and improve text recognition. Thanks to the help of the lexi-cality measure that automatically accept or reject the pre-trained OCR output as pseudo ground truth for a subsequent segmentation-free OCR training and recognition.

Mots clés

Text recognition pseudo ground truth segmentation-free OCR comic book image analysis

Domaines

Traitement des images [eess.IV] Traitement du texte et du document

Fichier principal

segmentation-free-speech.pdf (261.81 Ko)

Christophe Rigaud : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01719619

Soumis le : vendredi 2 mars 2018-09:52:58

Dernière modification le : jeudi 12 mai 2022-15:37:34

Archivage à long terme le : jeudi 31 mai 2018-13:00:59

Dates et versions

hal-01719619 , version 1 (02-03-2018)

Identifiants

HAL Id : hal-01719619 , version 1
DOI : 10.1109/ICDAR.2017.288

Citer

Christophe Rigaud, Jean-Christophe Burie, Jean-Marc Ogier. Segmentation-free speech text recognition for comic books. 2nd International Workshop on coMics Analysis, Processing, and Understanding (MANPU), Nov 2017, Kyoto, Japan. ⟨10.1109/ICDAR.2017.288⟩. ⟨hal-01719619⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

L3I UNIV-ROCHELLE

64 Consultations

401 Téléchargements

Segmentation-free speech text recognition for comic books

Reconnaissance du texte de bandes dessinées sans segmentation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager