Exemplary Sequence Cardinality: An Effective Application for Word Spotting

Abstract : In this paper, a new sequence matching algorithm called as Exemplary Sequence Cardinality (ESC) is proposed. ESC combines several abilities of other sequence matching algo- rithms e.g. DTW, SSDTW, CDP, FSM, MVM, OSB1. Depending on the application domain, ESC can be tuned to behave such as these different sequence matching algorithms. Its generality and robustness comes from its ability to find subsequences (as in CDP and SSDTW), to skip outliers inside the target sequences (as in MVM) and also in the query sequence (as in OSB) and it has the ability to have many to one and one to many correspondences (as in DTW) between the elements of the query and the target sequences. In case of word spotting application, the outliers skipping capability of ESC makes it less sensible to local variations in the spelling of words, and also to noise present in the query and/or in the target word images. Due to it’s capability of sub-sequence matching, the ESC algorithm has the ability to retrieve a query inside a line or piece of line. Finally, its multiple matching facilities (many to one and one to many matching) is proven to be well advantageous in case of different length of target and query sequences due to the variability in scale, font, type/size factors. During experiments, we show the interest of proposed ESC algorithm for the task of word spotting in historical document images in specific cases when incorrect word segmentation and word level local variations occur regularly.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01191704
Contributor : Nicolas Ragot <>
Submitted on : Wednesday, September 2, 2015 - 1:39:14 PM
Last modification on : Saturday, October 26, 2019 - 2:05:56 AM

Identifiers

  • HAL Id : hal-01191704, version 1

Citation

Tanmoy Mondal, Nicolas Ragot, Jean-Yves Ramel, Umapada Pal. Exemplary Sequence Cardinality: An Effective Application for Word Spotting. 13th International Conference on Document Analysis and Recognition (ICDAR 2015), Aug 2015, Nancy, France. ⟨hal-01191704⟩

Share

Metrics

Record views

109