Skip to Main content Skip to Navigation
Conference papers

A comparison study between MLP and Convolutional Neural Network models for character recognition

Abstract : Optical Character Recognition (OCR) systems have been designed to operate on text contained in scanned documents and images. They include text detection and character recognition in which characters are described then classified. In the classification step, characters are identified according to their features or template descriptions. Then, a given classifier is employed to identify characters. In this context, we have proposed the unified character descriptor (UCD) to represent characters based on their features. Then, matching was employed to ensure the classification. This recognition scheme performs a good OCR Accuracy on homogeneous scanned documents, however it cannot discriminate characters with high font variation and distortion. 3 To improve recognition, classifiers based on neural networks can be used. The multilayer perceptron (MLP) ensures high recognition accuracy when performing a robust training. Moreover, the convolutional neural network (CNN), is gaining nowadays a lot of popularity for its high performance. Furthermore, both CNN and MLP may suffer from the large amount of computation in the training phase. In this paper, we establish a comparison between MLP and CNN. We provide MLP with the UCD descriptor and the appropriate network configuration. For CNN, we employ the convolutional network designed for handwritten and machine-printed character recognition (Lenet-5) and we adapt it to support 62 classes, including both digits and characters. In addition, GPU parallelization is studied to speed up both of MLP and CNN classifiers. Based on our experimentations, we demonstrate that the used real-time CNN is 2x more relevant than MLP when classifying characters.
Complete list of metadata

Cited literature [25 references]  Display  Hide  Download
Contributor : Rostom Kachouri Connect in order to contact the contributor
Submitted on : Sunday, May 21, 2017 - 3:05:16 PM
Last modification on : Thursday, September 29, 2022 - 2:21:15 PM
Long-term archiving on: : Wednesday, August 23, 2017 - 10:51:09 AM


Files produced by the author(s)



Syrine Ben Driss, Mahmoud Soua, Rostom Kachouri, Mohamed Akil. A comparison study between MLP and Convolutional Neural Network models for character recognition. SPIE Conference on Real-Time Image and Video Processing, Apr 2017, Anaheim, CA, United States. ⟨10.1117/12.2262589⟩. ⟨hal-01525504⟩



Record views


Files downloads