L. Alexis, Available online: http://gallica.bnf.fr/ark, 2017.

A. Shahab, F. Shafait, T. Kieninger, and A. Dengel, An open approach towards the benchmarking of table structure recognition systems, Proceedings of the 8th IAPR International Workshop on Document Analysis Systems, DAS '10, pp.9-11, 2010.
DOI : 10.1145/1815330.1815345

G. Lazzara, R. Levillain, T. Géraud, Y. Jacquelet, and J. Marquegnies, Crépin-Leblond, A. The SCRIBO Module of the Olena Platform: a Free Software Framework for Document Image Analysis, Proceedings of the 2011 International Conference on Document Analysis and Recognition (ICDAR), pp.18-21, 2011.

I. Yalniz and R. Manmatha, A Fast Alignment Scheme for Automatic OCR Evaluation of Books, 2011 International Conference on Document Analysis and Recognition, pp.18-21, 2011.
DOI : 10.1109/ICDAR.2011.157

P. Roy, J. Ramel, and N. Ragot, Word Retrieval in Historical Document Using Character-Primitives, 2011 International Conference on Document Analysis and Recognition, pp.18-21, 2011.
DOI : 10.1109/ICDAR.2011.142

URL : https://hal.archives-ouvertes.fr/hal-01026452

. Iam-handwriting-database, Available online: http://www.iam.unibe.ch/fki/databases/iam-handwritingdatabase (accessed on 9, 2017.

E. Grosicki, M. Carré, J. M. Brodin, and E. Geoffrois, Results of the second RIMES evaluation campaign for handwritten mail processing, Proceedings of the 2009 10th International Conference on Document Analysis and Recognition (ICDAR), pp.26-29, 2009.

K. Nakagawa, A. Fujiyoshi, and M. Suzuki, Ground-truthed dataset of chemical structure images in Japanese published patent applications, Proceedings of the 8th IAPR International Workshop on Document Analysis Systems, DAS '10, pp.9-11, 2010.
DOI : 10.1145/1815330.1815389

. Eurecom, Available online: http://www.eurecom.fr/huet/work.html (accessed on 9, 2017.

S. University-of-california and . Francisco, The Legacy Tobacco Document Library (LTDL); University of California, 2007.

M. Delalandre, E. Valveny, T. Pridmore, and D. Karatzas, Generation of synthetic documents for performance evaluation of symbol recognition & spotting systems, International Journal on Document Analysis and Recognition (IJDAR), vol.12, issue.2, pp.187-207, 2010.
DOI : 10.1007/s10032-009-0083-y

URL : https://hal.archives-ouvertes.fr/hal-01022594

A. Fornés, A. Dutta, A. Gordo, J. Lladós, and . Cvc-muscima, CVC-MUSCIMA: a ground truth of handwritten music score images for writer identification and staff removal, International Journal on Document Analysis and Recognition (IJDAR), vol.2, issue.7, pp.243-251, 2012.
DOI : 10.1007/s10032-009-0100-1

N. Nayef, M. M. Luqman, S. Prum, S. Eskenazi, J. Chazalon et al., SmartDoc-QA: A dataset for quality assessment of smartphone captured document images - single and multiple distortions, 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp.23-26, 2015.
DOI : 10.1109/ICDAR.2015.7333960

URL : https://hal.archives-ouvertes.fr/hal-01319900

R. Tc11-online, Available online: http://tc11.cvc.uab.es/datasets, 2017.

B. Yanikoglu and L. Vincent, PINK PANTHER: A COMPLETE ENVIRONMENT FOR GROUND-TRUTHING AND BENCHMARKING DOCUMENT PAGE SEGMENTATION, Pattern Recognition, vol.31, issue.9, pp.31-1191, 1998.
DOI : 10.1016/S0031-3203(97)00137-4

H. Lee, C. Kanungo, and T. , The architecture of TrueViz: a groundTRUth/metadata editing and VIsualiZing ToolKit, Pattern Recognition, vol.36, issue.3, pp.811-825, 2003.
DOI : 10.1016/S0031-3203(02)00101-2

D. Doermann, E. Zotkina, and H. Li, GEDI?A Groundtruthing Environment for Document Images, Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp.9-11, 2010.

C. Clausner, S. Pletschacher, and A. Antonacopoulos, Efficient OCR Training Data Generation with Aletheia, Proceedings of the International Association for Pattern Recognition (IAPR), pp.7-10, 2014.

A. Garz, M. Seuret, F. Simistira, A. Fischer, and R. Ingold, Creating Ground Truth for Historical Manuscripts with Document Graphs and Scribbling Interaction, 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp.11-14, 2016.
DOI : 10.1109/DAS.2016.29

B. Gatos, G. Louloudis, T. Causer, K. Grint, V. Romero et al., Ground-Truth Production in the Transcriptorium Project, 2014 11th IAPR International Workshop on Document Analysis Systems, pp.7-10, 2014.
DOI : 10.1109/DAS.2014.23

H. Wei, K. Chen, M. Seuret, M. Würsch, M. Liwicki et al., DIVADIAWI? A Web-Based Interface for Semi-Automatic Labeling of Historical Document Images, 2015.

J. Mas, A. Fornés, and J. Lladós, An Interactive Transcription System of Census Records Using Word-Spotting Based Information Transfer, 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp.11-14, 2016.
DOI : 10.1109/DAS.2016.47

M. Recital and . Platform, Available online: http://recital.univ-nantes.fr, 2017.

C. Clausner, S. Pletschacher, and A. Antonacopoulos, Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments, 2011 International Conference on Document Analysis and Recognition, pp.18-21, 2011.
DOI : 10.1109/ICDAR.2011.19

H. S. Baird, Document Image Defect Models, Proceedings of the IAPR workshop on Syntatic and Structural Pattern Recognition, pp.13-15, 1990.
DOI : 10.1007/978-3-642-77281-8_26

Z. Jiuzhou, Creation of Synthetic Chart Image Database with Ground Truth, 2005.

E. Ishidera and D. Nishiwaki, A study on top-down word image generation for handwritten word recognition, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings., pp.3-6, 2003.
DOI : 10.1109/ICDAR.2003.1227842

F. Yin, Q. F. Wang, and C. L. Liu, Transcript Mapping for Handwritten Chinese Documents by Integrating Character Recognition Model and Geometric Context. Pattern Recognit, pp.2807-2818, 2013.
DOI : 10.1016/j.patcog.2013.03.013

M. Opitz, M. Diem, S. Fiel, and F. Kleber, Sablatnig, R. End-to-End Text Recognition Using Local Ternary Patterns, MSER and Deep Convolutional Nets, Proceedings of the 2014 11th IAPR International Workshop on Document Analysis Systems (DAS), pp.7-10, 2014.

S. Yacoub, V. Saxena, and S. Sami, PerfectDoc: a ground truthing environment for complex documents, Eighth International Conference on Document Analysis and Recognition (ICDAR'05), pp.452-456, 2005.
DOI : 10.1109/ICDAR.2005.187

E. Saund, J. Lin, and P. Sarkar, PixLabeler: User Interface for Pixel-Level Labeling of Elements in Document Images, 2009 10th International Conference on Document Analysis and Recognition, pp.26-29, 2009.
DOI : 10.1109/ICDAR.2009.250

B. Lamiroy and D. Lopresti, An Open Architecture for End-to-End Document Analysis Benchmarking, 2011 International Conference on Document Analysis and Recognition, pp.18-21, 2011.
DOI : 10.1109/ICDAR.2011.18

URL : https://hal.archives-ouvertes.fr/inria-00598907

M. Seuret, K. Chen, N. Eichenbergery, M. Liwicki, and R. Ingold, Gradient-domain degradations for improving historical documents images layout analysis, 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp.23-26, 2015.
DOI : 10.1109/ICDAR.2015.7333913

M. Mehri, P. Gomez-krämer, P. Héroux, and R. Mullot, Old document image segmentation using the autocorrelation function and multiresolution analysis, Document Recognition and Retrieval XX, pp.3-7, 2013.
DOI : 10.1117/12.2002365

URL : https://hal.archives-ouvertes.fr/hal-00787779

M. Visani, V. Kieu, A. Fornés, and N. Journet, ICDAR 2013 Music Scores Competition: Staff Removal, 2013 12th International Conference on Document Analysis and Recognition, pp.25-28, 2013.
DOI : 10.1109/ICDAR.2013.284

URL : https://hal.archives-ouvertes.fr/hal-01006096

I. D. Montagner, R. Hirata, . Jr, and N. S. Hirata, A Machine Learning Based Method for Staff Removal, 2014 22nd International Conference on Pattern Recognition, pp.24-28, 2014.
DOI : 10.1109/ICPR.2014.545

A. Fischer, M. Visani, V. C. Kieu, and C. Suen, Generation of learning samples for historical handwriting recognition using image degradation, Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, HIP '13, pp.24-73, 2013.
DOI : 10.1145/2501115.2501123

URL : https://hal.archives-ouvertes.fr/hal-01006088

R. Smith, An Overview of the Tesseract OCR Engine, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, pp.23-26, 2007.
DOI : 10.1109/ICDAR.2007.4376991

M. K. Bahaghighat and J. Mohammadi, Novel approach for baseline detection and Text line segmentation, Int. J. Comput. Appl, vol.20125120, issue.51, pp.108013-1039

A. Telea, An Image Inpainting Technique Based on the Fast Marching Method, Journal of Graphics Tools, vol.93, issue.4, pp.23-34, 2004.
DOI : 10.1073/pnas.93.4.1591

M. Mehri, P. Héroux, J. Lerouge, P. Gomez-krämer, and R. Mullot, A structural signature based on texture for digitized historical book page categorization, 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp.23-26, 2015.
DOI : 10.1109/ICDAR.2015.7333737

URL : https://hal.archives-ouvertes.fr/hal-01237209

T. M. Breuel, Two geometric algorithms for layout analysis In International Workshop on Document Analysis Systems, pp.188-199, 2002.

J. Y. Ramel, S. Leriche, M. Demonet, and S. Busson, User-driven page layout analysis of historical printed books, International Journal of Document Analysis and Recognition (IJDAR), vol.26, issue.6, pp.243-261, 2007.
DOI : 10.1006/cviu.1998.0684

URL : https://hal.archives-ouvertes.fr/hal-00150167

A. Garz, M. Seuret, A. Fischer, and R. Ingold, A User-Centered Segmentation Method for Complex Historical Manuscripts Based on Document Graphs, IEEE Transactions on Human-Machine Systems, vol.47, issue.2, pp.181-193, 2017.
DOI : 10.1109/THMS.2016.2634920

T. Kanungo and R. Haralick, Automatic generation of character groundtruth for scanned documents: a closed-loop approach, Proceedings of 13th International Conference on Pattern Recognition, pp.25-29, 1996.
DOI : 10.1109/ICPR.1996.547030

G. Shakhnarovich, Learning Task-Specific Similarity, 2005.

R. F. Moghaddam and M. Cheriet, Low quality document image modeling and enhancement, International Journal of Document Analysis and Recognition (IJDAR), vol.21, issue.3, pp.183-201, 2009.
DOI : 10.1111/j.1365-2389.2007.00986.x

L. Lelégard, M. Bredif, B. Vallet, and D. Boldo, Motion blur detection in aerial images shot with channel-dependent exposure time, Proceedings of the ISPRS-Technical-Commission III Symposium on Photogrammetric Computer Vision and Image Analysis (PCV), pp.1-3, 2010.

V. Kieu, N. Journet, M. Visani, R. Mullot, and J. Domenger, Semi-synthetic Document Image Generation Using Texture Mapping on Scanned 3D Document Shapes, 2013 12th International Conference on Document Analysis and Recognition, pp.25-28, 2013.
DOI : 10.1109/ICDAR.2013.104

URL : https://hal.archives-ouvertes.fr/hal-01006100

T. Kanungo, R. M. Haralick, and I. Phillips, Global and local document degradation models, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93), pp.20-22, 1993.
DOI : 10.1109/ICDAR.1993.395633

URL : http://www.cfar.umd.edu/~kanungo/pubs/icdar93.ps.Z

J. Calvo-zaragoza, L. Micó, and J. Oncina, Music staff removal with supervised pixel classification, International Journal on Document Analysis and Recognition (IJDAR), vol.31, issue.6, pp.1-9
DOI : 10.1109/ICDAR.2013.284

H. N. Bui, I. S. Na, and S. H. Kim, Staff Line Removal Using Line Adjacency Graph and Staff Line Skeleton for Camera-Based Printed Music Scores, 2014 22nd International Conference on Pattern Recognition, pp.24-28, 2014.
DOI : 10.1109/ICPR.2014.480

T. Géraud, A morphological method for music score staff removal, 2014 IEEE International Conference on Image Processing (ICIP), pp.27-30, 2014.
DOI : 10.1109/ICIP.2014.7025526

J. C. Zaragoza, Pattern Recognition for Music Notation, 2016.

I. S. Montagner, N. S. Hirata, R. Hirata, . Jr, and S. Canu, Kernel Approximations for W-Operator Learning, 2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pp.4-7, 2016.
DOI : 10.1109/SIBGRAPI.2016.060

I. Database, Available online: https://diuf.unifr.ch/main/hisdoc/iam-histdb (accessed on 9, 2017.

H. Wei, M. Baechler, F. Slimane, and R. Ingold, Evaluation of SVM, MLP and GMM Classifiers for Layout Analysis of Historical Documents, 2013 12th International Conference on Document Analysis and Recognition, pp.25-28, 2013.
DOI : 10.1109/ICDAR.2013.247

T. Varga and H. Bunke, Effects of training set expansion in handwriting recognition using synthetic data, Proceedings of the 11th Conference of the International Graphonomics Society, pp.2-5, 2003.

V. Rabeux, N. Journet, A. Vialard, and J. P. Domenger, Quality evaluation of degraded document images for binarization result prediction, International Journal on Document Analysis and Recognition (IJDAR), vol.32, issue.1, pp.1-13
DOI : 10.2307/2529336

URL : https://hal.archives-ouvertes.fr/hal-00862234

T. K. Bhowmik, T. Paquet, and N. Ragot, OCR Performance Prediction Using a Bag of Allographs and Support Vector Regression, 2014 11th IAPR International Workshop on Document Analysis Systems, pp.7-10, 2014.
DOI : 10.1109/DAS.2014.72

URL : https://hal.archives-ouvertes.fr/hal-01085002

X. Peng, H. Cao, and P. Natarajan, Document image OCR accuracy prediction via latent Dirichlet allocation, 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp.23-26, 2015.
DOI : 10.1109/ICDAR.2015.7333866

P. Ye and D. Doermann, Document Image Quality Assessment: A Brief Survey, 2013 12th International Conference on Document Analysis and Recognition, pp.25-28, 2013.
DOI : 10.1109/ICDAR.2013.148

URL : http://lampsrv02.umiacs.umd.edu/pubs/Papers/pengye-13a/pengye-13a.pdf

C. Clausner, S. Pletschacher, and A. Antonacopoulos, Quality Prediction System for Large-Scale Digitisation Workflows, 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp.11-14, 2016.
DOI : 10.1109/DAS.2016.82