Skip to Main content Skip to Navigation
Journal articles

Supervised learning and codebook optimization for bag of words models

Mingyuan Jiu 1 Christian Wolf 1 Christophe Garcia 1 Atilla Baskurt 1 
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In this paper, we present a novel approach for supervised codebook learning and optimization for bag of words models. This type of models is frequently used in visual recognition tasks like object class recognition or human action recognition. An entity is represented as a histogram of codewords, which are traditionally clustered with unsupervised methods like \textit{k}-means or random forests, and then classified in a supervised way. We propose a new supervised method for joint codebook creation and class learning, which learns the cluster centers of the codebook in a goal-directed way using the class labels of the training set. As a result, the codebook is highly correlated to the recognition problem, leading to a more discriminative codebook. We propose two different learning algorithms, one based on error backpropagation and one based on cluster label reassignment. We apply the proposed method to human action recognition from video sequences and evaluate it on the KTH dataset, reporting very promising results. The proposed technique allows to improve the discriminative power of an unsupervised learned codebook, or to keep the discriminative power while decreasing the size of the learned codebook, thus decreasing the computational complexity due to the nearest neighbor search.
Document type :
Journal articles
Complete list of metadata
Contributor : Équipe gestionnaire des publications SI LIRIS Connect in order to contact the contributor
Submitted on : Wednesday, August 10, 2016 - 4:17:06 PM
Last modification on : Tuesday, June 1, 2021 - 2:08:09 PM

Links full text



Mingyuan Jiu, Christian Wolf, Christophe Garcia, Atilla Baskurt. Supervised learning and codebook optimization for bag of words models. Cognitive Computation, Springer, 2012, 4, pp.409-419. ⟨10.1007/s12559-012-9137-4⟩. ⟨hal-01352965⟩



Record views