SnooperText: A Text Detection System for Automatic Indexing of Urban Scenes

Abstract : We describe SnooperText, an original detector for textual information embedded in photos of building façades (such as names of stores, products and services) that we developed for the iTowns urban geographic information project. SnooperText locates candidate characters by using toggle-mapping image segmentation and character/non-character classification based on shape descriptors. The candidate characters are then grouped to form either candidate words or candidate text lines. These candidate regions are then validated by a text/non-text classifier using a HOG-based descriptor specifically tuned to single-line text regions. These operations are applied at multiple image scales in order to suppress irrelevant detail in character shapes and to avoid the use of overly large kernels in the segmentation. We show that SnooperText outperforms other published state-of-the-art text detection algorithms on standard image benchmarks. We also describe two metrics to evaluate the end-to-end performance of text extraction systems, and show that the use of SnooperText as a pre-filter significantly improves the performance of a general-purpose OCR algorithm when applied to photos of urban scenes.
Type de document :
Article dans une revue
Computer Vision and Image Understanding, Elsevier, 2014, 122, pp.92-104. 〈10.1016/j.cviu.2013.10.004〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01185469
Contributeur : Lip6 Publications <>
Soumis le : jeudi 20 août 2015 - 11:41:27
Dernière modification le : jeudi 22 novembre 2018 - 14:53:32

Identifiants

Collections

Citation

Rodrigo Minetto, Nicolas Thome, Matthieu Cord, Neucimar J. Leite, Jorge Stolfi. SnooperText: A Text Detection System for Automatic Indexing of Urban Scenes. Computer Vision and Image Understanding, Elsevier, 2014, 122, pp.92-104. 〈10.1016/j.cviu.2013.10.004〉. 〈hal-01185469〉

Partager

Métriques

Consultations de la notice

88