A Unified framework for local visual descriptors evaluation

Olivier Kihl 1 David Picard 1 Philippe-Henri Gosselin 1, 2
1 MIDI - Multimedia Indexation and Data Integration
ETIS - Equipes Traitement de l'Information et Systèmes
2 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Local descriptors are the ground layer of recognition feature based systems for still images and video. We propose a new framework to explain local descriptors. This framework is based on the descriptors decomposition in three levels: primitive extraction, primitive coding and code aggregation. With this framework, we are able to explain most of the popular descriptors in the literature such as HOG, HOF, SURF. We propose two new projection methods based on approximation with oscillating functions basis (sinus and Legendre polynomials). Using our framework, we are able to extend usual descriptors by changing the code aggregation or adding new primitive coding method. The experiments are carried out on images (VOC 2007) and videos datasets (KTH, Hollywood2 and UCF11), and achieve equal or better performances than the literature.
Complete list of metadatas

Cited literature [54 references]  Display  Hide  Download

Contributor : Philippe-Henri Gosselin <>
Submitted on : Wednesday, December 3, 2014 - 11:52:31 AM
Last modification on : Friday, October 4, 2019 - 12:14:02 PM
Long-term archiving on : Saturday, April 15, 2017 - 12:20:36 AM


Files produced by the author(s)



Olivier Kihl, David Picard, Philippe-Henri Gosselin. A Unified framework for local visual descriptors evaluation. Pattern Recognition, Elsevier, 2015, 48, pp.1170-1180. ⟨10.1016/j.patcog.2014.11.013⟩. ⟨hal-01089310⟩



Record views


Files downloads