Symbol Knowledge Extraction from a Simple Graphical Language

Abstract : In this paper, we study the problem of symbol knowledge extraction. We assume that some unknown symbols are used to compose a handwritten message, and from a dataset of handwritten samples, we would like to recover the symbol set used in the corresponding language. We applied our approach on online handwriting, and select the domain of numerical expressions, mixing digits and operators, to test the ability to retrieve the corresponding symbol classes. The proposed method is based on three steps: a quantization of the stroke space, a description of the layout of strokes with a relational graph, and the extraction of an optimal lexicon using a minimum description length algorithm. At the symbol level, a recall rate of 74% is obtained on the test dataset produced by 100 writers.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00615208
Contributor : Harold Mouchère <>
Submitted on : Thursday, August 18, 2011 - 11:31:42 AM
Last modification on : Wednesday, December 19, 2018 - 3:02:03 PM

Identifiers

  • HAL Id : hal-00615208, version 1

Collections

Citation

Jinpeng Li, Harold Mouchère, Christian Viard-Gaudin. Symbol Knowledge Extraction from a Simple Graphical Language. 11th International Conference on Document Analysis and Recognition, ICDAR 2011, Sep 2011, Beijing, China. ⟨hal-00615208⟩

Share

Metrics

Record views

187