A Language Modeling Approach to Image Classification

Pierre Tirilly 1, * Vincent Claveau 1 Patrick Gros 1
* Corresponding author
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Due to the recent and fast diffusion of new digital devices (digital cameras, camera cell phones, internet), the number and size of image databases is dramatically increasing. Managing such databases is an important issue, for professional databases (e.g. from photo agencies) as well as for personal collections. Image classification and retrieval are therefore becoming more and more challenging. Discriminant image descriptors and robust classifiers are needed to handle these tasks. Nowadays approaches generally rely on describing images as a set of elementary and independent image patches called visual words, then using a classical classifier such as Support Vector Machines. In this paper, we propose a more precise description of images, called visual sentences, that includes simple spatial information between visual words. We then propose a classification technique based on language modeling. This classifier can exploit the spatial information of the visual sentences. Experiments on two classical datasets show that our classification method clearly outperforms the state-of- the-art SVM classifier.
Document type :
Conference papers
Complete list of metadatas

Contributor : Pierre Tirilly <>
Submitted on : Thursday, April 11, 2013 - 5:59:26 PM
Last modification on : Friday, November 16, 2018 - 1:28:14 AM


  • HAL Id : hal-00812187, version 1


Pierre Tirilly, Vincent Claveau, Patrick Gros. A Language Modeling Approach to Image Classification. Networked and Electronic Media Summit, 2008, Saint-Malo, France. ⟨hal-00812187⟩



Record views