Improved Shape Code Based Word Matching For Multi-script Documents - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Improved Shape Code Based Word Matching For Multi-script Documents

Résumé

In this paper, we propose a shape code based word-image matching (word-spotting) technique for word retrieval in multilingual documents, written in Indian languages. Each query word image to be searched is represented by a sequence of shape codes that corresponds to primitives. Then an inexact string matching technique is applied for measuring the similarity between the codes generated from the query word image and each candidate word images, obtained from the document. Based on the similarity score, we retrieve the document where the query image is found. Experimental results on Bangla, Devanagari scripts document image databases confirms the feasibility and efficiency of our proposed approach.
Fichier non déposé

Dates et versions

hal-01269783 , version 1 (05-02-2016)

Identifiants

  • HAL Id : hal-01269783 , version 1

Citer

Tanmoy Mondal, Nicolas Ragot, Jean-Yves Ramel, Umapada Pal. Improved Shape Code Based Word Matching For Multi-script Documents. IAPR Asian Conference on Pattern Recognition (ACPR2015), Nov 2015, KUALA LUMPUR, Malaysia. ⟨hal-01269783⟩
57 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More