A Vectorization and Decision Tree Based Text-Graphics Separation Algorithm for Bangla Maps - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

A Vectorization and Decision Tree Based Text-Graphics Separation Algorithm for Bangla Maps

Résumé

The present paper proposes a technique for text- graphics separation of geographical maps based on vectorization process and decision tree classification. In the proposed method, every map image is vectorized in order to extract a set of features for characterizing text and graphics. Vectorization provides structural primitives. We associate features to these structural primitives. A decision tree is then designed to discriminate text and graphics in map images, considering the features extracted from the vectorized images. This method provides a binary decision for every vectorized component, classifying the components into graphic or text. The proposed method was tested on a Bangla (a popular Indian regional language) maps dataset composed of a set of grey level images. The proposed text- graphic separation method provides 72.6% and 67.01% character and word-level text extraction accuracy respectively, when tested on map images.
Fichier non déposé

Dates et versions

hal-01196172 , version 1 (09-09-2015)

Identifiants

  • HAL Id : hal-01196172 , version 1

Citer

A. Tarafdar, Umapada Pal, Jean-Yves Ramel, Nicolas Ragot, Alireza Alaei. A Vectorization and Decision Tree Based Text-Graphics Separation Algorithm for Bangla Maps. 11th IAPR International Workshop on Graphics Recognition (GREC’15), Aug 2015, Nancy, France. ⟨hal-01196172⟩
163 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More