Visual search for objects in a complex visual context: what we wish to see

Abstract : In this work we propose a saliency based psycho-visual weighting of the BoVW for object recognition. This approach is designed to identify objects related to IADL on videos recorded by a wearable camera. These recording give an egocentric point-of-view on the upcoming action. This point- of-view is also characterized by a complex visual scene with several objects on the frame plan. The human visual system functions is a way to process only the relevant data by considering areas of interest. Based on this idea, we propose a new approach by introducing saliency models to discard irrelevant information in the video frames. Therefore we apply a visual saliency model to weight the image signature within the BoVW framework. Visual saliency is well suited for catching spatio-temporal information related to the observer's attention on the video frame. We also proposed an additional geometric saliency cue that models the anticipation phenomenon observed on subjects watching video content from the wearable camera. The findings show that discarding irrelevant features gives better performances when compared to the baseline method which consider the whole set of features in the images.
Type de document :
Chapitre d'ouvrage
Evaggelos Spyrou, Dimitris Iakovidis, Phivos Mylonas. Semantic Multimedia Analysis and Processing, CRC Press, p., 2014, Digital Imaging and Computer Vision
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-00993264
Contributeur : Aurélie Bugeau <>
Soumis le : mardi 20 mai 2014 - 00:02:59
Dernière modification le : mardi 20 mai 2014 - 10:03:18
Document(s) archivé(s) le : lundi 10 avril 2017 - 23:58:20

Fichier

CRCBook_Saliency_boujut_bugeau...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00993264, version 1

Collections

Citation

Hugo Boujut, Aurélie Bugeau, Jenny Benois-Pineau. Visual search for objects in a complex visual context: what we wish to see. Evaggelos Spyrou, Dimitris Iakovidis, Phivos Mylonas. Semantic Multimedia Analysis and Processing, CRC Press, p., 2014, Digital Imaging and Computer Vision. <hal-00993264>

Partager

Métriques

Consultations de
la notice

141

Téléchargements du document

178