S. Koukoulas and G. A. Blackburn, Introducing new indices for accuracy evaluation of classified images representing semi-natural woodland environments, Photogramm Eng Rem S, vol.67, pp.499-510, 2001.

L. A. Goodman and W. H. , Measures of Association for Cross Classification, J Am Stat Assoc, vol.49, pp.732-64, 1954.

J. Cohen, A Coefficient of Agreement for Nominal Scales, Educational and Psychological Measurement, vol.20, issue.1, pp.37-46, 1960.
DOI : 10.1177/001316446002000104

P. Jaccard, THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1, New Phytologist, vol.11, issue.2, pp.37-50, 1912.
DOI : 10.1111/j.1469-8137.1912.tb05611.x

M. Sokolova and G. Lapalme, A systematic analysis of performance measures for classification tasks, Information Processing &amp, Management, vol.45, pp.427-437, 2009.

S. V. Stehman, Comparing thematic maps based on map value, International Journal of Remote Sensing, vol.20, issue.12, pp.2347-2366, 1999.
DOI : 10.1080/014311699212065

S. V. Stehman, Selecting and interpreting measures of thematic classification accuracy, Remote Sensing of Environment, vol.62, issue.1, pp.77-89, 1997.
DOI : 10.1016/S0034-4257(97)00083-7

I. Guggenmoos-holzmann, How Reliable Are Chance-Corrected Measures of Agreement, pp.2191-2205, 1993.

V. Labatut and H. Cherifi, Accuracy Measures for the Comparison of Classifiers, International Conference on Information Technology, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00611319

C. X. Ling, J. Huang, and H. Zhang, AUC: a statistically consistent and more discriminating measure than accuracy, 18th International Conference on Artificial Intelligence, 2003.

P. A. Flach, The geometry of ROC space: understanding machine learning metrics through ROC isometrics, Twentieth International Conference on Machine Learning (ICML) Washington DC, 2003.

A. N. Albatineh, M. Niewiadomska-bugaj, and D. , On Similarity Indices and Correction for Chance Agreement, Journal of Classification, vol.23, issue.2, pp.301-313, 2006.
DOI : 10.1007/s00357-006-0017-z

R. Caruana and A. Niculescu, Data mining in metric space, Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '04, 2004.
DOI : 10.1145/1014052.1014063

C. R. Liu, P. Frazier, and L. Kumar, Comparative assessment of the measures of thematic classification accuracy, Remote Sensing of Environment, vol.107, issue.4, pp.606-616, 2007.
DOI : 10.1016/j.rse.2006.10.010

C. Ferri, J. Hernández-orallo, and R. Modroiu, An experimental comparison of performance measures for classification, Pattern Recognition Letters, vol.30, issue.1, pp.27-38, 2009.
DOI : 10.1016/j.patrec.2008.08.010

G. Türk, Gt index: A measure of the success of prediction, Remote Sensing of Environment, vol.8, issue.1, pp.65-75, 1979.
DOI : 10.1016/0034-4257(79)90024-5

J. T. Finn, Use of the average mutual information index in evaluating classification error and consistency, International journal of geographical information systems, vol.52, issue.4, pp.349-366, 1993.
DOI : 10.1037/h0026256

I. H. Witten and E. Frank, Data mining, ACM SIGMOD Record, vol.31, issue.1, 2005.
DOI : 10.1145/507338.507355

P. Villegas, E. Bru, B. Mayayo, L. Carpio, E. Alonso et al., Visual scene classification for image and video home content, 2008 International Workshop on Content-Based Multimedia Indexing, pp.77-84, 2008.
DOI : 10.1109/CBMI.2008.4564931

J. Linnartz, T. Kalker, and G. , Modelling the False Alarm and Missed Detection Rate for Electronic Watermarks, Lecture Notes in Computer Science, vol.1525, pp.329-343, 1998.
DOI : 10.1007/3-540-49380-8_23

T. Sørensen, A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons, Biologiske Skrifter / Kongelige Danske Videnskabernes Selskab, vol.5, pp.1-34, 1948.

U. Hellden, A test of landsat-2 imagery and digital data for thematic mapping illustrated by an environmental study in northern Kenya, Nat. Geog. Inst, p.47, 1980.

N. M. Short, The landsat tutorical workbook? Basics of satellite remote sensing, Goddard Space Flight Center, p.1078, 1982.

G. Türk, Chance correction and map evaluation, Remote Sensing of Environment, vol.82, issue.1, pp.123-129, 2002.
DOI : 10.1016/S0034-4257(02)00016-0

S. Kulczynski, Die Pflanzenassociationen der Pienenen, Bulletin International de L'Académie Polonaise des Sciences et des lettres, Classe des sciences mathématiques et naturelles, pp.57-203, 1927.

W. A. Scott, Reliability of Content Analysis: The Case of Nominal Scale Coding, Public Opinion Quarterly, vol.19, issue.3, pp.321-325, 1955.
DOI : 10.1086/266577

A. E. Maxwell, Coefficients of agreement between observers and their interpretation, The British Journal of Psychiatry, vol.130, issue.1, pp.79-83, 1977.
DOI : 10.1192/bjp.130.1.79

L. A. Goodman, The Analysis of Cross-Classified Data: Independence, Quasi-Independence, and Interactions in Contingency Tables with or Without Missing Entries, Journal of the American Statistical Association, vol.63, issue.324, pp.1091-1131, 1968.
DOI : 10.2307/2285873

P. S. Bullen, Handbook of Means and Their Inequalities, p.Kluwer, 2003.
DOI : 10.1007/978-94-017-0399-4

D. V. Cicchetti and A. R. , High agreement but low kappa: II. Resolving the paradoxes, Journal of Clinical Epidemiology, vol.43, issue.6, pp.551-559, 1990.
DOI : 10.1016/0895-4356(90)90159-M

F. E. Zegers and J. M. Berge, A family of association coefficients for metric scales, Psychometrika, vol.2, issue.1, pp.17-24, 1985.
DOI : 10.1007/BF02294144

T. Fung and E. Ledrew, The determination of optimal threshold levels for change detection using various accuracy indices, Photogramm Eng Rem S, vol.54, pp.1449-1454, 1988.