Hybrid Page Layout Analysis via Tab-Stop Detection, 2009 10th International Conference on Document Analysis and Recognition, 2009. ,
DOI : 10.1109/ICDAR.2009.257
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.308.972
An Overview of the Tesseract OCR Engine, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, pp.629-633, 2007. ,
DOI : 10.1109/ICDAR.2007.4376991
A Threshold Selection Method from Gray-Level Histograms, IEEE Transactions on Systems, Man, and Cybernetics, vol.9, issue.1, pp.62-66, 1979. ,
DOI : 10.1109/TSMC.1979.4310076
Improved Hybrid Binarization based on Kmeans for Heterogeneous document processing, 2015 9th International Symposium on Image and Signal Processing and Analysis (ISPA), 2015. ,
DOI : 10.1109/ISPA.2015.7306060
URL : https://hal.archives-ouvertes.fr/hal-01309993
AUTOMATIC TEXT EXTRACTION FROM COMPLEX COLORED IMAGES USING GAMMA CORRECTION METHOD, Journal of Computer Science, vol.10, issue.4, pp.705-715, 2014. ,
DOI : 10.3844/jcssp.2014.705.715
URL : http://doi.org/10.3844/jcssp.2014.705.715
Least squares quantization in PCM, IEEE Transactions on Information Theory, vol.28, issue.2, pp.129-137, 1982. ,
DOI : 10.1109/TIT.1982.1056489
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.131.1338
The open standard for parallel programming of heterogeneous systems ,
PixLabeler: User Interface for Pixel-Level Labeling of Elements in Document Images, 2009 10th International Conference on Document Analysis and Recognition, pp.446-450, 2009. ,
DOI : 10.1109/ICDAR.2009.250
Slides from Tesseract tutorial, 2016. ,
Two complementary techniques for digitized document analysis, Proceedings of the ACM conference on Document processing systems, DOCPROCS '88, pp.169-176, 1988. ,
DOI : 10.1145/62506.62539
FOUNDATION OF EVALUATION, Journal of Documentation, vol.1, issue.4, p.365373, 1974. ,
DOI : 10.1016/0020-0271(73)90066-1