C. Beckham and C. Pal, A simple squared-error reformulation for ordinal classification, 2016.

Y. Bengio, A. Courville, and P. Vincent, Representation learning: A review and new perspectives, IEEE transactions, vol.35, issue.8, pp.1798-1828, 2013.

A. Coates, H. Lee, and A. Y. Ng, An analysis of single-layer networks in unsupervised feature learning, Ann Arbor, vol.1001, issue.48109, 2010.

C. Eickhoff, I. Schwall, A. García-seco-de-herrera, and H. Müller, Overview of ImageCLEFcaption 2017-image caption prediction and concept detection for biomedical images. CLEF working notes, 2017.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.580-587, 2014.

J. Hosang, R. Benenson, and B. Schiele, How good are detection proposals, really?, 2014.

B. Ionescu, H. Müller, M. Villegas, H. Arenas, G. Boato et al., Overview of ImageCLEF 2017: Information extraction from images, CLEF 2017 Proceedings, vol.10456, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01913658

E. L. Mencía and J. Fürnkranz, Efficient multilabel classification algorithms for largescale problems in the legal domain, Semantic Processing of Legal Texts, pp.192-215, 2010.

S. Ren, K. He, R. Girshick, and J. Sun, Faster r-cnn: Towards real-time object detection with region proposal networks, Advances in neural information processing systems, pp.91-99, 2015.

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh et al., Imagenet large scale visual recognition challenge, International Journal of Computer Vision, vol.115, issue.3, pp.211-252, 2015.

S. Shankar, V. K. Garg, and R. Cipolla, Deep-carving: Discovering visual attributes by carving deep neural nets, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.3403-3412, 2015.

A. Sharif-razavian, H. Azizpour, J. Sullivan, and S. Carlsson, Cnn features off-theshelf: an astounding baseline for recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp.806-813, 2014.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

J. R. Uijlings, K. E. Van-de-sande, T. Gevers, and A. W. Smeulders, Selective search for object recognition, International journal of computer vision, vol.104, issue.2, pp.154-171, 2013.

J. Yosinski, J. Clune, Y. Bengio, and H. Lipson, How transferable are features in deep neural networks? In: Advances in neural information processing systems, pp.3320-3328, 2014.

Z. H. Zhou, M. L. Zhang, S. J. Huang, and Y. F. Li, Multi-instance multi-label learning, Artificial Intelligence, vol.176, issue.1, pp.2291-2320, 2012.