A syntax-directed method for numerical field extraction using classifier combination
Résumé
In this article, we propose a method for the automatic extraction of numerical fields in handwritten documents. The method exploits the syntax of a numerical field as an a priori knowledge to extract the connected component sequences from the document. For that, we have to label the connected components as “belonging to a numerical field” or not. We propose a method for discriminating the connected components, using different families of features and a combination of classifiers. A comparison between the results obtained with the combination of classifiers and our first approach [10] demonstrates the utility of combining different feature sets for discriminating classes with large intra-class variability.
Domaines
Traitement du texte et du document
Origine : Fichiers produits par l'(les) auteur(s)
Loading...