Deep learning for predicting image memorability - Archive ouverte HAL Access content directly
Conference Papers Year : 2018

Deep learning for predicting image memorability

Abstract

Memorability of media content such as images and videos has recently become an important research subject in computer vision. This paper presents our computation model for predicting image memorability, which is based on a deep learning architecture designed for a classification task. We exploit the use of both convolutional neural network (CNN)-based visual features and semantic features related to image caption-ing for the task. We train and test our model on the large-scale benchmarking memorability dataset: LaMem. Experiment result shows that the proposed computational model obtains better prediction performance than the state of the art, and even outperforms human consistency. We further investigate the genericity of our model on other memorability datasets. Finally, by validating the model on interestingness datasets, we reconfirm the uncorrelation between memorability and in-terestingness of images. Index Terms— Image memorability, computational model, deep learning, interestingness, image captioning
Fichier principal
Vignette du fichier
main.pdf (8.27 Mo) Télécharger le fichier
IEEEbib.bst (17.48 Ko) Télécharger le fichier
Picture1.jpg (465.95 Ko) Télécharger le fichier
Picture1_without_normalization.jpg (469.88 Ko) Télécharger le fichier
bestmodel.jpg (75.3 Ko) Télécharger le fichier
dataset_Bainbridge.jpg (33.28 Ko) Télécharger le fichier
dataset_Borkin.jpg (33.08 Ko) Télécharger le fichier
dataset_Dubey.jpg (35.79 Ko) Télécharger le fichier
dataset_Filgrim.jpg (36.96 Ko) Télécharger le fichier
dataset_IAPS.jpg (36.9 Ko) Télécharger le fichier
dataset_Isola.jpg (51.99 Ko) Télécharger le fichier
dataset_LaMem.jpg (38.69 Ko) Télécharger le fichier
workflow.jpg (59.55 Ko) Télécharger le fichier
~$pictures.pptx (165 B) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01629297 , version 1 (06-11-2017)

Identifiers

  • HAL Id : hal-01629297 , version 1

Cite

Hammad Squalli-Houssaini, Ngoc Q. K. Duong, Marquant Gwenaëlle, Claire-Hélène Demarty. Deep learning for predicting image memorability. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2018, Calgary, Canada. ⟨hal-01629297⟩
146 View
789 Download

Share

Gmail Facebook X LinkedIn More