A. Smeaton, P. Over, and W. Kraaij, Evaluation campaigns and TRECVid, Proceedings of the 8th ACM international workshop on Multimedia information retrieval , MIR '06, pp.321-330, 2006.
DOI : 10.1145/1178677.1178722

P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders et al., TRECVID 2015 ? An Overview of the Goals, Tasks, Data, Evaluation Mechanisms, and Metrics In Proceedings of TRECVID 2015, pp.16-18, 2015.

Y. Cheng and S. Chen, Image classification using color, texture and regions, Image and Vision Computing, pp.759-776, 2003.
DOI : 10.1016/S0262-8856(03)00069-6

P. H. Gosselin, M. Cord, and S. Philipp-foliguet, Combining visual dictionary, kernel-based similarity and learning strategy for image category retrieval, Computer Vision and Image Understanding, Special Issue on Similarity Matching in Computer Vision and Multimedia, pp.403-441, 2008.
DOI : 10.1016/j.cviu.2007.09.018

URL : https://hal.archives-ouvertes.fr/hal-00520290

M. Redi and B. Merialdo, Saliency moments for image categorization, Proceedings of the 1st ACM International Conference on Multimedia Retrieval, ICMR '11, 2011.
DOI : 10.1145/1991996.1992035

URL : http://www.eurecom.fr/en/publication/3360/download/mm-publi-3360.pdf

D. Picard and P. H. Gosselin, Efficient image signatures and similarities using tensor products of local descriptors, Computer Vision and Image Understanding, pp.680-687, 2013.
DOI : 10.1016/j.cviu.2013.02.004

URL : https://hal.archives-ouvertes.fr/hal-00799074

P. H. Gosselin, N. Murray, H. Jegou, and F. Perronnin, Revisiting the Fisher vector for fine-grained classification, Pattern Recognition Letters, pp.92-98, 2014.
DOI : 10.1016/j.patrec.2014.06.011

URL : https://hal.archives-ouvertes.fr/hal-01056223

A. Oliva and A. Torralba, Modeling the shape of the scene: A holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001.
DOI : 10.1023/A:1011139631724

K. E. Van-de-sande, T. Gevers, and C. G. Snoek, Evaluation of color descriptors for object and scene recognition, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1582-1596, 2010.
DOI : 10.1109/CVPR.2008.4587658

A. Benoit, A. Caplier, B. Durette, and J. Herault, Using Human Visual System modeling for bio-inspired low level image processing, Computer Vision and Image Understanding, pp.758-773, 2010.
DOI : 10.1016/j.cviu.2010.01.011

URL : https://hal.archives-ouvertes.fr/hal-00377222

J. Sánchez, F. Perronnin, and T. Mensink, Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013.
DOI : 10.1007/s11263-006-9794-4

. Ballas, IRIM at TRECVID 2012: Semantic Indexing and Multimedia Instance Search, Proceedings of the TRECVID 2011 workshop, pp.26-28, 2012.

. Safadi, Quaero at TRECVID 2013: Semantic Indexing and Collaborative Annotation, Proceedings of the TRECVID 2013 workshop, pp.20-22, 2013.

. Ballas, IRIM at TRECVID 2013: Semantic Indexing and Multimedia Instance Search, Proceedings of the TRECVID 2013 workshop, pp.20-22, 2013.

N. Ballas, B. Labbé, and H. L. Borgne, Aymen Shabou CEA LIST at TRECVID 2013: Instance Search, Proceedings of the TRECVID 2013 workshop, pp.20-22, 2013.

. Safadi, LIG at TRECVID 2014: Semantic Indexing, Proceedings of the TRECVID 2014 workshop, pp.10-12, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01132455

. Ballas, IRIM at TRECVID 2014: Semantic Indexing and Instance search, Proceedings of the TRECVID 2014 workshop, pp.10-12, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01132491

. Safadi, LIG at TRECVID 2015: Semantic Indexing, Proceedings of the TRECVID 2015 workshop, pp.16-18, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01233401

S. Ayache and G. Quénot, Video Corpus Annotation Using Active Learning, 30th European Conference on Information Retrieval (ECIR'08), 2008.
DOI : 10.1007/978-3-540-78646-7_19

URL : https://hal.archives-ouvertes.fr/hal-01089795

D. Gorisse, IRIM at TRECVID 2010: High Level Feature Extraction and Instance Search, TREC Video Retrieval Evaluation workshop, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00953839

B. Safadi and G. Quénot, Re-ranking by Local Rescoring for Video Indexing and Retrieval, CIKM 2011: 20th ACM Conference on Information and Knowledge Management, 2011.
DOI : 10.1145/2063576.2063895

URL : https://hal.archives-ouvertes.fr/hal-00763624

B. Safadi and G. Quénot, Descriptor Optimization for Multimedia Indexing and Retrieval, Multimedia Tools and Applications, pp.1267-1290, 2015.
DOI : 10.1007/s11042-014-2071-6

URL : https://hal.archives-ouvertes.fr/hal-00953090

S. T. Strat, A. Benoit, and P. Lambert, Retina enhanced SIFT descriptors for video indexing, CBMI 2013, 11th International Workshop on Content-Based Multimedia Indexing, 2013.
DOI : 10.1109/cbmi.2013.6576582

A. Hamadi, G. Quénot, and P. Mulhem, Conceptual feedback for semantic multimedia indexing, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI), pp.1225-1248, 2015.
DOI : 10.1109/CBMI.2013.6576552

URL : https://hal.archives-ouvertes.fr/hal-00953085

S. T. Strat, A. Benoit, P. Lambert, H. Bredin, and G. Quénot, Hierarchical Late Fusion for Concept Detection in Videos, Fusion in Computer Vision -Understanding Complex Visual Content Advances in Computer Vision and Pattern Recognition, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00732740

Y. Freund and R. E. Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, Journal of Computer and System Sciences, vol.55, issue.1, pp.119-139, 1997.
DOI : 10.1006/jcss.1997.1504

C. G. Snoek, M. Worring, J. Geusebroek, D. Koelma, and F. J. Seinstra, On The Surplus Value of Semantic Video Analysis Beyond the Key Frame, 2005 IEEE International Conference on Multimedia and Expo, pp.6-8, 2005.
DOI : 10.1109/ICME.2005.1521441

A. Shabou and H. L. Borgne, Locality-constrained and spatially regularized coding for scene categorization, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.3618-3625, 2012.
DOI : 10.1109/CVPR.2012.6248107

C. Zhu, C. Bichot, and L. Chen, Color orthogonal local binary patterns combination for image region description, Technical Report, LIRIS UMR5205 CNRS

Y. Jia, E. Helhamer, J. Donahue, S. Karayev, J. Long et al., Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, p.14, 2014.
DOI : 10.1145/2647868.2654889

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Advances in Neural Information Processing Systems (NIPS), pp.1097-105, 2012.

K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, Return of the Devil in the Details: Delving Deep into Convolutional Nets, Proceedings of the British Machine Vision Conference 2014, p.2014
DOI : 10.5244/C.28.6

K. Simonyan and A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed et al., Going Deeper with Convolutions arXiv, pp.1409-4842

A. Mikulik, M. Perdoch, O. Chum, and J. Matas, Learning Vocabularies over a Fine Quantization, International Journal of Computer Vision, vol.30, issue.2, pp.163-175, 2013.
DOI : 10.1109/TPAMI.2007.70755

T. Strat, A. Benoit, and P. , Lambert Retina enhanced bag of words descriptors for video classification Eusipco, 2014.

S. T. Strat, A. Benoit, and P. Lambert, Bags of Trajectory Words for video indexing, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014.
DOI : 10.1109/CBMI.2014.6849820

URL : https://hal.archives-ouvertes.fr/hal-01096109