Visual word disambiguation by semantic contexts

Yu Su 1 Frédéric Jurie 1
1 Equipe Image - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen
Abstract : This paper presents a novel schema to address the polysemy of visual words in the widely used bag-of-words model. As a visual word may have multiple meanings, we show it is possible to use semantic contexts to disambiguate these meanings and therefore improve the performance of bag-of words model. On one hand, for an image, multiple contextspecific bag-of-words histograms are constructed, each of which corresponds to a semantic context. Then these histograms are merged by selecting only the most discriminative context for each visual word, resulting in a compact image representation. On the other hand, an image is represented by the occurrence probabilities of semantic contexts. Finally, when classifying an image, two image representations are combined at decision level to utilize the complementary information embedded in them. Experiments on three challenging image databases (PASCAL VOC 2007, Scene-15 and MSRCv2) show that our method significantly outperforms state-of-the-art classification methods.
Document type :
Conference papers
Complete list of metadatas

Cited literature [36 references]  Display  Hide  Download
Contributor : Yvain Queau <>
Submitted on : Friday, April 5, 2013 - 7:03:09 PM
Last modification on : Thursday, February 7, 2019 - 5:47:10 PM
Long-term archiving on : Monday, April 3, 2017 - 1:10:23 AM


Files produced by the author(s)



Yu Su, Frédéric Jurie. Visual word disambiguation by semantic contexts. IEEE Intenational Conference on Computer Vision (ICCV), 2011, Spain. pp.311-318, ⟨10.1109/ICCV.2011.6126257⟩. ⟨hal-00808655⟩



Record views


Files downloads