S. Arlot and M. Lerasle, Why V=5 is enough in V-fold cross-validation, ArXiv eprints, 2012.

S. Arlot and A. Celisse, A survey of cross-validation procedures for model selection, Statistics Surveys, vol.4, issue.0, pp.40-79, 2010.
DOI : 10.1214/09-SS054

URL : https://hal.archives-ouvertes.fr/hal-00407906

R. Babbar, I. Partalas, E. Gaussier, and M. R. Amini, Re-ranking approach to classification in large-scale power-law distributed category systems, Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, SIGIR '14, p.14, 2014.
DOI : 10.1145/2600428.2609509

URL : https://hal.archives-ouvertes.fr/hal-01118830

A. Bella, C. Ferri, J. Hernández-orallo, and M. J. Ramírez-quintana, Quantification via Probability Estimators, 2010 IEEE International Conference on Data Mining, pp.737-742, 2010.
DOI : 10.1109/ICDM.2010.75

Y. Bengio and Y. Grandvalet, No unbiased estimator of the variance of k-fold crossvalidation, Journal of Machine Learning Research, vol.5, pp.1089-1105, 2004.

A. Blum, A. Kalai, and J. Langford, Beating the hold-out, Proceedings of the twelfth annual conference on Computational learning theory , COLT '99, pp.203-208, 1999.
DOI : 10.1145/307400.307439

O. Chapelle, B. Schölkopf, and A. Zien, Semi-Supervised Learning, 2006.
DOI : 10.7551/mitpress/9780262033589.001.0001

A. Esuli and F. Sebastiani, Optimizing Text Quantifiers for Multivariate Loss Functions, ACM Transactions on Knowledge Discovery from Data, vol.9, issue.4, 2013.
DOI : 10.1145/2700406

R. E. Fan, K. W. Chang, C. J. Hsieh, X. R. Wang, and C. J. Lin, LIBLINEAR: A library for large linear classification, Journal of Machine Learning Research, vol.9, 2008.

G. Forman, Counting Positives Accurately Despite Inaccurate Classification, Machine Learning: ECML 2005, pp.564-575, 2005.
DOI : 10.1007/11564096_55

G. Forman, Quantifying counts and costs via classification, Data Mining and Knowledge Discovery, vol.17, issue.6, pp.164-206, 2008.
DOI : 10.1007/s10618-008-0097-y

R. Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, Proceedings of the 14th International Joint Conference on Artificial Intelligence. IJCAI'95, 1995.

M. Mohri, A. Rostamizadeh, and A. Talwalkar, Foundations of Machine Learning, 2012.

I. Partalas, A. Kosmopoulos, N. Baskiotis, T. Artieres, G. Paliouras et al., Lshtc: A benchmark for largescale text classification, p.8581, 2015.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905