Robust frame and text extraction from comic books

Abstract : Comic books constitute an important heritage in many countries. Nowadays, digitization allows to search directly from content instead of metadata only (e.g. album title or author name). Few studies have been done in this direction. Only frame and speech balloon extraction have been experimented in the case of simple page structure. In fact, the page structure depends on the author which is why many different structures and drawings exist. Despite the differences, drawings have a common characteristic because of design process: they are all surrounded by a black line. In this paper, we propose to rely on this particularity of comic books to automatically extract frame and text using a connected-component labeling analysis. The approach is compared with some existing methods found in the literature and results are presented.
Document type :
Journal articles
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00841493
Contributor : Christophe Rigaud <>
Submitted on : Friday, July 5, 2013 - 10:44:42 AM
Last modification on : Thursday, October 3, 2019 - 4:22:03 PM
Long-term archiving on : Sunday, October 6, 2013 - 4:12:26 AM

File

2013_Rigaud_Robust_frame_and_t...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00841493, version 1

Collections

Citation

Christophe Rigaud, Norbert Tsopze, Jean-Christophe Burie, Jean-Marc Ogier. Robust frame and text extraction from comic books. Graphics Recognition. New Trends and Challenges Lecture Notes in Computer Science, 2013, 7423, pp.129-138. ⟨hal-00841493⟩

Share

Metrics

Record views

242

Files downloads

1242