Supervised Learning of Gaussian Mixture Models forVisual Vocabulary Generation - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Pattern Recognition Année : 2012

Supervised Learning of Gaussian Mixture Models forVisual Vocabulary Generation

Elisa Fromont
Damien Muselet
Marc Sebban

Résumé

The creation of semantically relevant clusters is vital in bag-of-visual words models which are known to be very successful to achieve image classification tasks. Generally, unsupervised clustering algorithms, such as K-means, are employed to create such clusters from which visual dictionaries are deduced. K-means achieves a hard assignment by associating each image descriptor to the cluster with the nearest mean. By this way, the within-cluster sum of squares of distances is minimized. A limitation of this approach in the context of image classification is that it usually does not use any supervision that limits the discriminative power of the resulting visual words (typically the centroids of the clusters). More recently, some supervised dictionary creation methods based on both supervised information and data fitting were proposed leading to more discriminative visual words. But, none of them consider the uncertainty present at both image descriptor and cluster levels. In this paper, we propose a supervised learning algorithm based on a Gaussian Mixture model which not only generalizes the K-means algorithm by allowing soft assignments, but also exploits supervised information to improve the discriminative power of the clusters. Technically, our algorithm aims at optimizing, using an EM-based approach, a convex combination of two criteria: the first one is unsupervised and based on the likelihood of the training data; the second is supervised and takes into account the purity of the clusters. We show on two well known datasets that our method is able to create more relevant clusters by comparing its behavior with the state of the art dictionary creation methods.
Fichier principal
Vignette du fichier
PR2011.pdf (511.76 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00617693 , version 1 (30-08-2011)

Identifiants

Citer

Basura Fernando, Elisa Fromont, Damien Muselet, Marc Sebban. Supervised Learning of Gaussian Mixture Models forVisual Vocabulary Generation. Pattern Recognition, 2012, 45 (2), pp.897-907. ⟨10.1016/j.patcog.2011.07.021⟩. ⟨hal-00617693⟩
267 Consultations
1257 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More