J. Ah-pine, A. Ah-pine, J. Cifarelli, C. Clinchant, S. Csurka et al., Crossing textual and visual content in different application scenarios, Proceedings, pp.31-56, 2008.
DOI : 10.1007/s11042-008-0246-8

URL : https://hal.archives-ouvertes.fr/hal-01504484

J. Ah-pine, S. Clinchant, G. Csurka, Y. Liu, G. Crete et al., XRCE's participation to ImageCLEF Evaluation of diversity-focused strategies for multimedia retrieval, Evaluating Systems for Multilingual and Multimodal Information Access, 2009.

J. Ah-pine, S. Clinchant, and G. Csurka, Comparison of Several Combinations of Multimodal and Diversity Seeking Methods for Multimedia Retrieval, 2010.
DOI : 10.1007/978-3-642-15751-6_13

URL : https://hal.archives-ouvertes.fr/hal-01504499

K. Barnard, P. Duygulu, D. Forsyth, D. Freitas, and M. Jordan, Matching words and pictures, J of Machine Learning Research, vol.3, 2003.

D. Blei, . Michael, M. Jordan, . Acm, F. Boudin et al., Modeling annotated data A scalable MMR approach to sentence scoring for multi-document update summarization, 2003.

J. Carbonell and J. Goldstein, The use of MMR, diversity-based reranking for reordering documents and producing summaries, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '98, 1998.
DOI : 10.1145/290941.291025

P. Carbonetto, N. De-freitas, and K. Barnard, A Statistical Model for General Contextual Object Recognition, 2004.
DOI : 10.1007/978-3-540-24670-1_27

S. Clinchant, J. Renders, and G. Csurka, Xrce's participation to imageclef, Working Notes of the 2007 CLEF Workshop, 2007.

S. Clinchant, J. Renders, and G. Csurka, Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval, Advances in Multilingual and Multimodal Information Retrieval, 2008.
DOI : 10.1007/978-3-540-85760-0_71

G. Csurka, C. Dance, L. Fan, J. Willamowski, C. Bray et al., Visual categorization with bags of keypoints Object recognition as machine translation :learning a lexicon for a fixed image vocabulary, ECCV Workshop on Statistical Learning for Computer Vision hello! my name is... buffy " ? automatic naming of characters in TV video, 2002.

S. Feng, V. Lavrenko, and R. Manmatha, Multiple Bernoulli relevance models for image and video annotation, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004.
DOI : 10.1109/CVPR.2004.1315274

URL : http://ciir.cs.umass.edu/pubfiles/mm-333.pdf

T. Huang, C. Dagli, S. Rajaram, E. Chang, M. Mandel et al., Active Learning for Interactive Multimedia Retrieval, Proceedings of the IEEE, vol.96, issue.4, 2008.
DOI : 10.1109/JPROC.2008.916364

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.476.1448

G. Iyengar, P. Duygulu, S. Feng, P. Ircing, S. Khudanpur et al., Joint visual-text modeling for automatic retrieval of multimedia documents, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, 2005.
DOI : 10.1145/1101149.1101154

T. Jaakkola and D. Haussler, Exploiting generative models in discriminative classifiers, Advances in Neural Information Processing Systems 11, 1999.

J. Jeon, V. Lavrenko, and R. Manmatha, Automatic image annotation and retrieval using crossmedia relevance models, 2003.
DOI : 10.1145/860435.860459

URL : http://ciir.cs.umass.edu/pubfiles/mm-41.pdf

V. Lavrenko, R. Manmatha, J. Jeon, V. Lavrenko, S. Feng et al., A model for learning the semantics of pictures Models for automatic video annotation and retrieval, 2003.

J. Li and J. Wang, Automatic linguistic indexing of pictures by a statistical modeling approach, PAMI, vol.25, p.9, 2003.

Z. Lin, T. Chua, M. Kan, W. Lee, L. Qiu et al., NUS at DUC 2007: Using evolutionary models of text, Document Understanding Conference, 2005.

N. Maillot, J. Chevallet, V. Valea, and J. Lim, Ipal inter-media pseudo-relevance feedback approach to imageclef 2006 photo retrieval, Working Notes, 2006.
DOI : 10.1007/978-3-540-74999-8_92

URL : https://hal.archives-ouvertes.fr/hal-00954108

J. Marcotorchino and P. Michaud, Heuristic approach of the similarity aggregation problem, Methods of operation research, vol.43, pp.395-404, 1981.

F. Monay and D. Gatica-perez, PLSA-based image auto-annotation, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, 2004.
DOI : 10.1145/1027527.1027608

Y. Mori, H. Takahashi, and R. Oka, Image-to-word transformation based on dividing and vector quantizing images with words, MISRM'99 First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.

J. Pan, H. Yang, C. Faloutsos, and P. Duygulu, Gcap: Graph-based automatic image captioning In: CVPR Workshop on Multimedia Data and Document Engineering Perronnin F (2010) Large-scale image retrieval with compressed fisher vectors Fisher kernels on visual vocabularies for image categorization Active feedback in ad hoc information retrieval Video google: A text retrieval approach to object matching in videos, In: CVPR Perronnin F, Dance C In: SIGIR Sivic JS, 2003.

A. Vinokourov, D. Hardoon, and J. Shawe-taylor, Learning the semantics of multimedia content with application to web image retrieval and classification, Fourth International Symposium on Independent Component Analysis and Blind Source Separation, 2003.

C. Zhai and J. Lafferty, Model-based feedback in the language modeling approach to information retrieval, Proceedings of the tenth international conference on Information and knowledge management , CIKM'01, pp.403-410, 2001.
DOI : 10.1145/502585.502654