J. Deng, W. Dong, R. Socher, L. J. Li, K. Li et al., ImageNet: A large-scale hierarchical image database, In: CVPR, 2009.

G. Checkik, V. Sharma, U. Shalit, and S. Bengio, Large Scale Online Learning of Image Similarity through Ranking, Journal of Machine Learning Research, vol.11, pp.1109-1135, 2010.
DOI : 10.1007/978-3-642-02172-5_2

J. Deng, A. Berg, K. Li, and L. Fei-fei, What Does Classifying More Than 10,000 Image Categories Tell Us?, 2010.
DOI : 10.1007/978-3-642-15555-0_6
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.173.3680

A. Vedaldi and A. Zisserman, Efficient additive kernels via explicit feature maps, In: CVPR, 2010.
DOI : 10.1109/cvpr.2010.5539949
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.7024

M. Rohrbach, M. Stark, and B. Schiele, Evaluating knowledge transfer and zero-shot learning in a large-scale setting, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995627

H. Jégou, M. Douze, and C. Schmid, Product Quantization for Nearest Neighbor Search, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, pp.117-128, 2011.
DOI : 10.1109/TPAMI.2010.57

J. Weston, S. Bengio, and N. Usunier, WSABIE: Scaling up to large vocabulary image annotation, In: IJCAI, 2011.

J. Sánchez and F. Perronnin, High-dimensional signature compression for large-scale image classification, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995504

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, 2012.
DOI : 10.1109/TPAMI.2011.235

Y. Lin, F. Lv, S. Zhu, M. Yang, T. Cour et al., Large-scale image classification: Fast feature extraction and SVM training, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995477
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.225.3736

F. Perronnin, Z. Akata, Z. Harchaoui, and C. Schmid, Towards good practice in large-scale learning for image classification, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248090
URL : https://hal.archives-ouvertes.fr/hal-00690014

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459266
URL : https://hal.archives-ouvertes.fr/inria-00439276

A. R. Webb, Statistical pattern recognition, 2002.
DOI : 10.1002/9781119952954

C. Veenman and D. Tax, LESS: a model-based classifier for sparse subspaces, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.9, pp.1496-1500, 2005.
DOI : 10.1109/TPAMI.2005.182

X. Zhou, X. Zhang, Z. Yan, S. F. Chang, M. Hasegawa-johnson et al., SIFT-Bag kernel for video event analysis, Proceeding of the 16th ACM international conference on Multimedia, MM '08, 2008.
DOI : 10.1145/1459359.1459391
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.139.3496

K. Weinberger and L. Saul, Distance metric learning for large margin nearest neighbor classification, Journal of Machine Learning Research, vol.10, pp.207-244, 2009.

L. Bottou, Large-scale machine learning with stochastic gradient descent, In: COMPSTAT, 2010.
DOI : 10.1007/978-3-7908-2604-3_16
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.419.462

R. Gray and D. Neuhoff, Quantization, IEEE Transactions on Information Theory, vol.44, issue.6, pp.2325-2383, 1998.
DOI : 10.1109/18.720541

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, 2010.
DOI : 10.1007/978-3-642-15561-1_11
URL : https://hal.archives-ouvertes.fr/inria-00548630

J. Zhang, M. Marsza?ek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.73-213, 2007.
DOI : 10.1007/s11263-006-9794-4
URL : https://hal.archives-ouvertes.fr/inria-00548574

E. Nowak and F. Jurie, Learning Visual Similarity Measures for Comparing Never Seen Objects, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.382969
URL : https://hal.archives-ouvertes.fr/hal-00203958

J. Chai, H. Liua, B. Chenb, and Z. Baoa, Large margin nearest local mean classifier, Signal Processing, vol.90, issue.1, pp.236-248, 2010.
DOI : 10.1016/j.sigpro.2009.06.015

L. Fei-fei, R. Fergus, and P. Perona, One-shot learning of object categories, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.4, pp.594-611, 2006.
DOI : 10.1109/TPAMI.2006.79

C. Lampert, H. Nickisch, and S. Harmeling, Learning to detect unseen object classes by between-class attribute transfer, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206594

T. Tommasi and B. Caputo, The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.80

H. Larochelle, D. Erhan, and Y. Bengio, Zero-data learning of new tasks, AAAI Conference on Artificial Intelligence, 2008.

B. Bai, J. Weston, D. Grangier, R. Collobert, Y. Qi et al., Learning to rank with (a lot of) word features, Information Retrieval, vol.22, issue.1, pp.291-314, 2010.
DOI : 10.1007/s10791-009-9117-9

J. L. Gauvain and C. H. Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Transactions on Speech and Audio Processing, vol.2, issue.2, pp.291-298, 1994.
DOI : 10.1109/89.279278