Skip to Main content Skip to Navigation
Conference papers

Visual Concept Detection and Annotation via Multiple Kernel Learning of multiple models

Yu Zhang 1 Stéphane Bres 1 Liming Chen 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : This paper presents a multi-model framework for Visual Concept Detection and Annotation(VCDA) task based on Multiple Kernel Learning(MKL). To extract discriminative visual features and built visual kernels, meanwhile the tags associated with images are used to build the textual kernels. Finally, in order to benefit from both visual models and textual models, fusion is carried out by MKL efficiently embed. Traditionally the term frequencies model is used to capture this useful textual information. However, the shortcoming in the term frequencies model lies that the performance seriously depends on the dictionary construction and the valuable semantic information can not be captured. To solve this problem, we propose one textual feature construction approach based on $WordNet$ distance. The advantages of this approach are three-fold: (1) It is robust, because our feature construction approach does not depend on dictionary construction. (2) It can capture tags semantic information which is hardly described by the term frequencies model. (3) It efficiently fuses visual models and textual models. The experimental results on the ImageCLEF 2011 show that our approach effectively improves the recognition accuracy.
Document type :
Conference papers
Complete list of metadata
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Wednesday, June 29, 2016 - 3:52:13 PM
Last modification on : Wednesday, July 8, 2020 - 12:43:45 PM

Links full text



Yu Zhang, Stéphane Bres, Liming Chen. Visual Concept Detection and Annotation via Multiple Kernel Learning of multiple models. The International Conference on Image Analysis and Processing (ICIAP 2013), Sep 2013, Naples, Italy. pp.581-590, ⟨10.1007/978-3-642-41184-7_59⟩. ⟨hal-01339303⟩



Record views