Hybrid OCR combination approach complemented by a specialized ICR applied on ancient documents - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2005

Hybrid OCR combination approach complemented by a specialized ICR applied on ancient documents

Hubert Cecotti
  • Fonction : Auteur
  • PersonId : 830534
Abdel Belaïd
  • Fonction : Auteur
  • PersonId : 830137

Résumé

In spite of the improvement of Commercial Optical Character Recognition (OCR) during the last years, their ability to process different kinds of documents can also be a default. They cannot produce a perfect recognition for all documents. However they allow producing high result for standard cases. We propose in this paper a model combining several OCRs and a specialized ICR (Intelligent Character Recognition) based on a convolutional neural network to complement them. Instead of just performing several OCRs in parallel and applying a fusing rule of the results, a specialized neural network with an adaptive topology is added to complement the OCRs in function of the OCRs errors. This system has been tested on ancient documents containing old characters and old fonts not used in contemporary documents. The OCRs combination increases the recognition of about 3\% whereas the ICR improves the recognition of rejected characters of more than 5\%.
Fichier non déposé

Dates et versions

inria-00000363 , version 1 (27-09-2005)

Identifiants

  • HAL Id : inria-00000363 , version 1

Citer

Hubert Cecotti, Abdel Belaïd. Hybrid OCR combination approach complemented by a specialized ICR applied on ancient documents. 8th International Conference in Document Analysis and Recognition - ICDAR'05, Aug 2005, Seoul, Korea, pp.1045-1049. ⟨inria-00000363⟩
93 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More