Exploiting Multimodality in Video Hyperlinking to Improve Target Diversity

Abstract : Video hyperlinking is the process of creating links within a collection of videos to help navigation and information seeking. Starting from a given set of video segments, called anchors, a set of related segments, called targets, must be provided. In past years, a number of content-based approaches have been proposed with good results obtained by searching for target segments that are very similar to the anchor in terms of content and information. Unfortunately, relevance has been obtained to the expense of diversity. In this paper, we study multimodal approaches and their ability to provide a set of diverse yet relevant targets. We compare two recently introduced cross-modal approaches, namely, deep auto-encoders and bimodal LDA, and experimentally show that both provide significantly more diverse targets than a state-of-the-art baseline. Bimodal autoencoders offer the best trade-off between relevance and diversity, with bimodal LDA exhibiting slightly more diverse targets at a lower precision.
Liste complète des métadonnées

Cited literature [26 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01498130
Contributor : Guillaume Gravier <>
Submitted on : Thursday, March 22, 2018 - 11:01:20 PM
Last modification on : Monday, December 17, 2018 - 9:06:01 AM
Document(s) archivé(s) le : Thursday, September 13, 2018 - 8:45:47 AM

File

diversity.pdf
Files produced by the author(s)

Identifiers

Citation

Rémi Bois, Vedran Vukotić, Anca-Roxana Simon, Ronan Sicre, Christian Raymond, et al.. Exploiting Multimodality in Video Hyperlinking to Improve Target Diversity. MMM2017 - International Conference on Multimedia Modeling, Jan 2017, Reykyavik, Iceland. ⟨10.1007/978-3-319-51814-5_16⟩. ⟨hal-01498130v2⟩

Share

Metrics

Record views

819

Files downloads

86