STTK-based Video Object Recognition

Shuji Zhao 1 Frédéric Precioso 1 Matthieu Cord 2
1 MIDI - Multimedia Indexation and Data Integration
ETIS - Equipes Traitement de l'Information et Systèmes
2 MALIRE - Machine Learning and Information Retrieval
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : In this paper, we extend our video object recognition system to multiclass object recognition context, dealing with unbalanced data sets and comparing our resuls to state-of-the-art methods. Our approach is based on a Spatio-Temporal data representation, a dedicated kernel design and statistical learning techniques for object recognition. From video tracks made of segmented object regions in the successive frames, we extract sets of spatio-temporally coherent SIFT-based features, called Spatio-Temporal Tubes. To compare these complex tube objects, we integrate a Spatio-Temporal Tube Kernel (STTK) function into a multi-class classification framework with balancing process for unequal classes. Our approach is successfully evaluated on episodes from “Buffy, the Vampire Slayer” TV series which have been used in other works targeting same objectives. Our method proved to be more robust than dictionary based, facial feature based and key-frame based approaches. Our method is also tested on a small car database and preliminary results for car identification task illustrate its generalization potential.
Document type :
Conference papers
Complete list of metadatas
Contributor : Michel Jordan <>
Submitted on : Friday, January 11, 2013 - 2:53:12 PM
Last modification on : Friday, October 4, 2019 - 12:14:02 PM



Shuji Zhao, Frédéric Precioso, Matthieu Cord. STTK-based Video Object Recognition. ICIP 2010 - 17th IEEE International Conference on Image Processing, Sep 2010, Hong-Kong, Hong Kong SAR China. pp.3873-3876, ⟨10.1109/ICIP.2010.5651177⟩. ⟨hal-00773037⟩



Record views