Visual Concept Detection and Annotation via Multiple Kernel Learning of multiple models

Yu Zhang; Stéphane Bres; Liming Chen

doi:10.1007/978-3-642-41184-7_59

Communication Dans Un Congrès Année : 2013

Visual Concept Detection and Annotation via Multiple Kernel Learning of multiple models

(1) , (1) , (1)

Yu Zhang

Fonction : Auteur
PersonId : 843135

Extraction de Caractéristiques et Identification

Stéphane Bres

Fonction : Auteur
PersonId : 7718
IdHAL : stephane-bres
IdRef : 074601946

Extraction de Caractéristiques et Identification

Liming Chen

Fonction : Auteur
PersonId : 7562
IdHAL : liming-chen
IdRef : 067400175

Extraction de Caractéristiques et Identification

Résumé

This paper presents a multi-model framework for Visual Concept Detection and Annotation(VCDA) task based on Multiple Kernel Learning(MKL). To extract discriminative visual features and built visual kernels, meanwhile the tags associated with images are used to build the textual kernels. Finally, in order to benefit from both visual models and textual models, fusion is carried out by MKL efficiently embed. Traditionally the term frequencies model is used to capture this useful textual information. However, the shortcoming in the term frequencies model lies that the performance seriously depends on the dictionary construction and the valuable semantic information can not be captured. To solve this problem, we propose one textual feature construction approach based on $WordNet$ distance. The advantages of this approach are three-fold: (1) It is robust, because our feature construction approach does not depend on dictionary construction. (2) It can capture tags semantic information which is hardly described by the term frequencies model. (3) It efficiently fuses visual models and textual models. The experimental results on the ImageCLEF 2011 show that our approach effectively improves the recognition accuracy.

Domaines

Informatique [cs]

Équipe gestionnaire des publications SI LIRIS : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01339303

Soumis le : mercredi 29 juin 2016-15:52:13

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-01339303 , version 1 (29-06-2016)

Identifiants

HAL Id : hal-01339303 , version 1
DOI : 10.1007/978-3-642-41184-7_59

Citer

Yu Zhang, Stéphane Bres, Liming Chen. Visual Concept Detection and Annotation via Multiple Kernel Learning of multiple models. The International Conference on Image Analysis and Processing (ICIAP 2013), Sep 2013, Naples, Italy. pp.581-590, ⟨10.1007/978-3-642-41184-7_59⟩. ⟨hal-01339303⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS LABEXIMU INSA-GROUPE UDL EC_LYON_STRICT

129 Consultations

0 Téléchargements

Visual Concept Detection and Annotation via Multiple Kernel Learning of multiple models

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager