OCR Accuracy Improvement Through a PDE-based Approach

Fadoua Drira 1 Frank Le Bourgeois 1 Hubert Emptoz 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : This paper focuses on improving the optical character recognition (OCR) system 's accuracy by restoring damaged character through a PDE (Partial Differential Equation)-based approach. This approach, proposed by D. Tschumperle, is an anisotropic diffusion approach driven by local tensors fields. Actually, such approach has many useful properties that are relevant for use in character restoration. For instance, this approach is very appropriate for the processing of oriented patterns which are major characteristics of textual documents. It incorporates both edge enhancing diffusion that tends to preserve local structures during smoothing and coherence-enhancing diffusion that processes oriented structures by smoothing along the flow direction. Furthermore, this tensor diffusion-based approach compared to the existing sate of the art requires neither segmentation nor training steps. Some experiments, done on degraded document images, illustrate the performance of this PDE-based approach in improving both of the visual quality and the OCR accuracy rates for degraded document images.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01593324
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Tuesday, September 26, 2017 - 10:07:24 AM
Last modification on : Wednesday, October 31, 2018 - 12:24:25 PM

Identifiers

Citation

Fadoua Drira, Frank Le Bourgeois, Hubert Emptoz. OCR Accuracy Improvement Through a PDE-based Approach. 9th International Conference on Document Analysis and Recognition, ICDAR 2007, Sep 2007, Parana, Brazil. pp.1068-1072, ⟨10.1109/ICDAR.2007.4377079⟩. ⟨hal-01593324⟩

Share

Metrics

Record views

82