A study on multimodal video hyperlinking with visual aggregation

Mateusz Budnik; Mikail Demirdelen; Guillaume Gravier

doi:10.1109/ICME.2018.8486549

Communication Dans Un Congrès Année : 2018

A study on multimodal video hyperlinking with visual aggregation

(1) , (1) , (1)

Mateusz Budnik

Fonction : Auteur

Creating and exploiting explicit links between multimedia fragments

Mikail Demirdelen

Fonction : Auteur
PersonId : 995040

Creating and exploiting explicit links between multimedia fragments

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Creating and exploiting explicit links between multimedia fragments

Résumé

Video hyperlinking offers a way to explore a video collection, making use of links that connect segments having related content. Hyperlinking systems thus seek to automatically create links by connecting given anchor segments to relevant targets within the collection. In this paper, we further investigate multimodal representations of video segments in a hyper-linking system based on bidirectional deep neural networks, which achieved state-of-the-art results in the TRECVid 2016 evaluation. A systematic study of different input representations is done with a focus on the aggregation of the representation of multiple keyframes. This includes, in particular, the use of memory vectors as a novel aggregation technique, which provides a significant improvement over other aggre-gation methods on the final hyperlinking task. Additionally, the use of metadata is investigated leading to increased performance and lower computational requirements for the system.

Mots clés

multimodal embedding Index Terms— Video hyperlinking Multimedia Computer Vision Natural Language Processing Deep Learning

Domaines

Multimédia [cs.MM] Son [cs.SD] Traitement du signal et de l'image [eess.SP] Traitement du texte et du document Vision par ordinateur et reconnaissance de formes [cs.CV] Apprentissage [cs.LG]

Fichier principal

study-multimodal-video.pdf (173.48 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Guillaume Gravier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01862199

Soumis le : lundi 27 août 2018-10:39:50

Dernière modification le : vendredi 24 mars 2023-14:53:08

Archivage à long terme le : mercredi 28 novembre 2018-13:04:27

Dates et versions

hal-01862199 , version 1 (27-08-2018)

Identifiants

HAL Id : hal-01862199 , version 1
DOI : 10.1109/ICME.2018.8486549

Citer

Mateusz Budnik, Mikail Demirdelen, Guillaume Gravier. A study on multimodal video hyperlinking with visual aggregation. IEEE International Conference on Multimedia and Expo, Jul 2018, San Diego, United States. pp.1-6, ⟨10.1109/ICME.2018.8486549⟩. ⟨hal-01862199⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

241 Consultations

223 Téléchargements

A study on multimodal video hyperlinking with visual aggregation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager