Skip to Main content Skip to Navigation
Conference papers

Exploiting Multimodality in Video Hyperlinking to Improve Target Diversity

Abstract : Video hyperlinking is the process of creating links within a collection of videos to help navigation and information seeking. Starting from a given set of video segments, called anchors, a set of related segments, called targets, must be provided. In past years, a number of content-based approaches have been proposed with good results obtained by searching for target segments that are very similar to the anchor in terms of content and information. Unfortunately, relevance has been obtained to the expense of diversity. In this paper, we study multimodal approaches and their ability to provide a set of diverse yet relevant targets. We compare two recently introduced cross-modal approaches, namely, deep auto-encoders and bimodal LDA, and experimentally show that both provide significantly more diverse targets than a state-of-the-art baseline. Bimodal autoencoders offer the best trade-off between relevance and diversity, with bimodal LDA exhibiting slightly more diverse targets at a lower precision.
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download
Contributor : Guillaume Gravier Connect in order to contact the contributor
Submitted on : Thursday, March 22, 2018 - 11:01:20 PM
Last modification on : Friday, April 8, 2022 - 4:08:03 PM
Long-term archiving on: : Thursday, September 13, 2018 - 8:45:47 AM


Files produced by the author(s)



Rémi Bois, Vedran Vukotić, Anca-Roxana Simon, Ronan Sicre, Christian Raymond, et al.. Exploiting Multimodality in Video Hyperlinking to Improve Target Diversity. MMM2017 - International Conference on Multimedia Modeling, Jan 2017, Reykyavik, Iceland. ⟨10.1007/978-3-319-51814-5_16⟩. ⟨hal-01498130v2⟩



Record views


Files downloads