A study on multimodal video hyperlinking with visual aggregation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

A study on multimodal video hyperlinking with visual aggregation

Résumé

Video hyperlinking offers a way to explore a video collection, making use of links that connect segments having related content. Hyperlinking systems thus seek to automatically create links by connecting given anchor segments to relevant targets within the collection. In this paper, we further investigate multimodal representations of video segments in a hyper-linking system based on bidirectional deep neural networks, which achieved state-of-the-art results in the TRECVid 2016 evaluation. A systematic study of different input representations is done with a focus on the aggregation of the representation of multiple keyframes. This includes, in particular, the use of memory vectors as a novel aggregation technique, which provides a significant improvement over other aggre-gation methods on the final hyperlinking task. Additionally, the use of metadata is investigated leading to increased performance and lower computational requirements for the system.
Fichier principal
Vignette du fichier
study-multimodal-video.pdf (173.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01862199 , version 1 (27-08-2018)

Identifiants

Citer

Mateusz Budnik, Mikail Demirdelen, Guillaume Gravier. A study on multimodal video hyperlinking with visual aggregation. IEEE International Conference on Multimedia and Expo, Jul 2018, San Diego, United States. pp.1-6, ⟨10.1109/ICME.2018.8486549⟩. ⟨hal-01862199⟩
241 Consultations
223 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More