K. Barnard, P. Duygulu, D. Forsyth, N. De-freitas, D. Blei et al., Matching words and pictures, JMLR, vol.3, pp.1107-1135, 2003.

S. Bucak, P. Mallapragada, R. Jin, and A. Jain, Efficient multi-label ranking for multi-class learning: Application to object recognition, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459460

G. Carneiro, A. Chan, P. Moreno, and N. Vasconcelos, Supervised Learning of Semantic Classes for Image Annotation and Retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.3, pp.394-410, 2007.
DOI : 10.1109/TPAMI.2007.61

C. Cusano, G. Ciocca, and R. Schettini, Image annotation using SVM, Proceedings Internet imaging (SPIE), 2004.
DOI : 10.1117/12.526746

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.3235

M. Douze, M. Guillaumin, T. Mensink, C. Schmid, and J. Verbeek, INRIA-LEARs participation to ImageCLEF, Working Notes for the CLEF 2009 Workshop, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00439299

S. Feng, R. Manmatha, and V. Lavrenko, Multiple Bernoulli relevance models for image and video annotation, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004.
DOI : 10.1109/CVPR.2004.1315274

P. Gehler and S. Nowozin, On feature combination for multiclass object classification, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459169

D. Grangier and S. Bengio, A Discriminative Kernel-Based Approach to Rank Images from Text Queries, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.8, pp.1371-1384, 2008.
DOI : 10.1109/TPAMI.2007.70791

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459266

URL : https://hal.archives-ouvertes.fr/inria-00439276

T. Hertz, A. Bar-hillel, and D. Weinshall, Learning distance functions for image retrieval, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004.
DOI : 10.1109/CVPR.2004.1315215

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.106.5883

M. Huiskes and M. Lew, The MIR flickr retrieval evaluation, Proceeding of the 1st ACM international conference on Multimedia information retrieval, MIR '08, 2008.
DOI : 10.1145/1460096.1460104

H. Jégou, C. Schmid, H. Harzallah, and J. Verbeek, Accurate Image Search Using the Contextual Dissimilarity Measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.1, pp.2-11, 2010.
DOI : 10.1109/TPAMI.2008.285

J. Jeon, V. Lavrenko, and R. Manmatha, Automatic image annotation and retrieval using cross-media relevance models, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , SIGIR '03, 2003.
DOI : 10.1145/860435.860459

URL : http://ciir.cs.umass.edu/pubfiles/mm-41.pdf

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

J. Li and J. Wang, Real-time computerized annotation of pictures, Proceedings of the 14th annual ACM international conference on Multimedia , MULTIMEDIA '06, pp.985-1002, 2008.
DOI : 10.1145/1180639.1180841

J. Liu, M. Li, Q. Liu, H. Lu, and S. Ma, Image annotation via graph learning, Pattern Recognition, vol.42, issue.2, pp.218-228, 2009.
DOI : 10.1016/j.patcog.2008.04.012

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

A. Makadia, V. Pavlovic, and S. Kumar, A New Baseline for Image Annotation, ECCV, 2008.
DOI : 10.1007/978-3-540-88690-7_24

T. Mei, Y. Wang, X. Hua, S. Gong, and S. Li, Coherent image annotation by learning semantic distance, CVPR, 2008.

F. Monay and D. Gatica-perez, PLSA-based image auto-annotation, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, 2004.
DOI : 10.1145/1027527.1027608

A. Oliva and A. Torralba, Modeling the shape of the scene: a holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001.
DOI : 10.1023/A:1011139631724

J. Pan, H. Yang, C. Faloutsos, and P. Duygulu, Automatic multimedia cross-modal correlation discovery, Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '04, 2004.
DOI : 10.1145/1014052.1014135

J. Van-de-weijer and C. Schmid, Coloring Local Feature Extraction, ECCV, 2006.
DOI : 10.1002/col.10049

URL : https://hal.archives-ouvertes.fr/inria-00548576

O. Yakhnenko and V. Honavar, Annotating images and image objects using a hierarchical dirichlet process model, Proceedings of the 9th International Workshop on Multimedia Data Mining held in conjunction with the ACM SIGKDD 2008, MDM '08, 2008.
DOI : 10.1145/1509212.1509213

H. Zhang, A. Berg, M. Maire, and J. Malik, SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2126-2136, 2006.
DOI : 10.1109/CVPR.2006.301

J. Zhang, M. Marsza-lek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007.
DOI : 10.1007/s11263-006-9794-4

URL : https://hal.archives-ouvertes.fr/inria-00548574