M. Boutell, J. Luo, and C. Brown, Scene Parsing Using Region-Based Generative Models, IEEE Transactions on Multimedia, vol.9, issue.1, pp.136-146, 2007.
DOI : 10.1109/TMM.2006.886372

Y. Chang, H. Ann, and W. Yeh, A unique-ID-based matrix strategy for efficient iconic indexing of symbolic pictures, Pattern Recognition, vol.33, issue.8, pp.1263-1276, 2000.
DOI : 10.1016/S0031-3203(99)00115-6

T. Chua, K. Tan, and B. Ooi, Fast signature-based color-spatial image retrieval, pp.362-369, 1997.

R. Datta, D. Joshi, J. Li, and J. Wang, Image retrieval, ACM Computing Surveys, vol.40, issue.2, pp.1-60, 2008.
DOI : 10.1145/1348246.1348248

M. Egenhofer and J. Herring, Categorizing binary topological relationships between regions, lines and points in geographic databases In: A framework for the definition of topological relationships and an approach to spatial reasoning within this framework, 1991.

P. Felzenszwalb and D. Huttenlocher, Efficient Graph-Based Image Segmentation, International Journal of Computer Vision, vol.59, issue.2, pp.167-181, 2004.
DOI : 10.1023/B:VISI.0000022288.19776.77

S. Gao, D. Wang, and C. Lee, Automatic image annotation through multi-topic text categorization, Proc. of ICASSP, pp.377-380, 2006.

D. Han, W. Li, and Z. Li, Semantic image classification using statistical local spatial relations model, Multimedia Tools and Applications, vol.28, issue.8, pp.169-188, 2008.
DOI : 10.1007/s11042-008-0203-6

Y. Hironobu, H. Takahashi, and R. Oka, Image-to-word transformation based on dividing and vector quantizing images with words, Neural networks, pp.405-409, 1999.

J. Jeon, V. Lavrenko, and R. Manmatha, Automatic image annotation and retrieval using crossmedia relevance models, SIGIR '03, pp.119-126, 2003.

J. Li and J. Wang, Automatic linguistic indexing of pictures by a statistical modeling approach, IEEE PAMI, vol.25, issue.9, pp.1075-1088, 2003.

J. Lim, Y. Li, Y. You, and J. Chevallet, Scene Recognition with Camera Phones for Tourist Information Access, Multimedia and Expo, 2007 IEEE International Conference on, p.7, 2007.
DOI : 10.1109/ICME.2007.4284596

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

L. Maisonnasse, E. Gaussier, and J. Chevallet, Revisiting the dependence language model for information retrieval, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, p.7, 2007.
DOI : 10.1145/1277741.1277863

URL : https://hal.archives-ouvertes.fr/hal-00953986

L. Maisonnasse, E. Gaussier, and J. Chevalet, Model Fusion in Conceptual Language Modeling, ECIR '09, pp.240-251, 2009.
DOI : 10.1007/978-3-540-85760-0_59

URL : https://hal.archives-ouvertes.fr/hal-00953849

C. Manning, P. Raghavan, and H. Schtze, Language models for information retrieval, pp.237-252, 2009.
DOI : 10.1017/CBO9780511809071.013

P. Mulhem and E. Debanne, A framework for mixed symbolic-based and feature-based query by example image retrieval, Int J Inf Technol, vol.12, issue.1, pp.74-98, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00953804

I. Ounis and M. Pasca, RELIEF, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '98, pp.266-274, 1998.
DOI : 10.1145/290941.291007

G. Papadopoulos, V. Mezaris, I. Kompatsiaris, and M. Strintzis, Combining Global and Local Information for Knowledge-Assisted Image Analysis and Classification, Special Issue on Knowledge-Assisted Media Analysis for Interactive Multimedia Applications, 2007.
DOI : 10.1109/72.788646

T. Pham, L. Maisonnasse, and P. Mulhem, Visual language modeling for mobile localization: Lig participation in Robotvision'09, CLEF working notes, 2009.
URL : https://hal.archives-ouvertes.fr/hal-01010253

T. Pham, L. Maisonnasse, P. Mulhem, and E. Gaussier, Integration of spatial relationships in visual language model for scene retrieval, 2010 International Workshop on Content Based Multimedia Indexing (CBMI), p.8, 2010.
DOI : 10.1109/CBMI.2010.5529894

URL : https://hal.archives-ouvertes.fr/hal-00953832

T. Pham, P. Mulhem, and L. Maisonnasse, Spatial relationships in visual graph modeling for image categorization, Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval, SIGIR '10, 2010.
DOI : 10.1145/1835449.1835587

URL : https://hal.archives-ouvertes.fr/hal-00953836

T. Pham and A. Smeulders, Learning spatial relations in object recognition, Pattern Recognition Letters, vol.27, issue.14, pp.1673-1684, 2006.
DOI : 10.1016/j.patrec.2006.03.016

J. Ponte and W. Croft, A language modeling approach to information retrieval, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '98, p.98, 1998.
DOI : 10.1145/290941.291008

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

J. Smith and S. Chang, VisualSEEk, Proceedings of the fourth ACM international conference on Multimedia , MULTIMEDIA '96, pp.87-98, 1996.
DOI : 10.1145/244130.244151

F. Song and W. Croft, A general language model for information retrieval, Proceedings of the eighth international conference on Information and knowledge management , CIKM '99, pp.316-321, 1999.
DOI : 10.1145/319950.320022

P. Tirilly, V. Claveau, and P. Gros, Language modeling for bag-of-visual words image categorization, Proceedings of the 2008 international conference on Content-based image and video retrieval, CIVR '08, pp.249-258, 2008.
DOI : 10.1145/1386352.1386388

URL : https://hal.archives-ouvertes.fr/hal-00811922

C. Won, D. Park, and S. Park, Efficient Use of MPEG-7 Edge Histogram Descriptor, ETRI Journal, vol.24, issue.1, 2002.
DOI : 10.4218/etrij.02.0102.0103

L. Wu, M. Li, Z. Li, W. Ma, and N. Yu, Visual language modeling for image classification, Proceedings of the international workshop on Workshop on multimedia information retrieval , MIR '07, pp.115-124, 2007.
DOI : 10.1145/1290082.1290101

C. Zhai and J. Lafferty, A study of smoothing methods for language models applied to Ad Hoc information retrieval, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '01, pp.334-342, 2001.
DOI : 10.1145/383952.384019