HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Book sections

Visual search for objects in a complex visual context: what we wish to see

Abstract : In this work we propose a saliency based psycho-visual weighting of the BoVW for object recognition. This approach is designed to identify objects related to IADL on videos recorded by a wearable camera. These recording give an egocentric point-of-view on the upcoming action. This point- of-view is also characterized by a complex visual scene with several objects on the frame plan. The human visual system functions is a way to process only the relevant data by considering areas of interest. Based on this idea, we propose a new approach by introducing saliency models to discard irrelevant information in the video frames. Therefore we apply a visual saliency model to weight the image signature within the BoVW framework. Visual saliency is well suited for catching spatio-temporal information related to the observer's attention on the video frame. We also proposed an additional geometric saliency cue that models the anticipation phenomenon observed on subjects watching video content from the wearable camera. The findings show that discarding irrelevant features gives better performances when compared to the baseline method which consider the whole set of features in the images.
Complete list of metadata

Cited literature [52 references]  Display  Hide  Download

Contributor : Aurélie Bugeau Connect in order to contact the contributor
Submitted on : Tuesday, May 20, 2014 - 12:02:59 AM
Last modification on : Saturday, March 5, 2022 - 3:18:02 PM
Long-term archiving on: : Monday, April 10, 2017 - 11:58:20 PM


Files produced by the author(s)


  • HAL Id : hal-00993264, version 1



Hugo Boujut, Aurélie Bugeau, Jenny Benois-Pineau. Visual search for objects in a complex visual context: what we wish to see. Evaggelos Spyrou, Dimitris Iakovidis, Phivos Mylonas. Semantic Multimedia Analysis and Processing, CRC Press, p., 2014, Digital Imaging and Computer Vision. ⟨hal-00993264⟩



Record views


Files downloads