HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Tag Propagation Approaches within Speaking Face Graphs for Multimodal Person Discovery

Abstract : The indexing of broadcast TV archives is a current problem in multimedia research. As the size of these databases grows continuously, meaningful features are needed to describe and connect their elements efficiently, such as the identification of speaking faces. In this context, this paper focuses on two approaches for unsupervised person discovery. Initial tagging of speaking faces is provided by an OCR-based method, and these tags propagate through a graph model based on audiovisual relations between speaking faces. Two propagation methods are proposed, one based on random walks and the other based on a hierarchical approach. To better evaluate their performances, these methods were compared with two graph clustering baselines. We also study the impact of different modality fusions on the graph-based tag propagation scenario. From a quantitative analysis, we observed that the graph propagation techniques always outperform the baselines. Among all compared strategies, the methods based on hierarchical propagation with late fusion and random walk with score-fusion obtained the highest MAP values. Finally, even though these two methods produce highly equivalent results according to Kappa coefficient, the random walk method performs better according to a paired t-test, and the computing time for the hierarchical propagation is more than 4 times lower than the one for the random walk propagation.
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download

Contributor : Gabriel Sargent Connect in order to contact the contributor
Submitted on : Friday, June 30, 2017 - 2:04:54 PM
Last modification on : Friday, April 8, 2022 - 4:08:03 PM
Long-term archiving on: : Monday, January 22, 2018 - 8:38:40 PM


Files produced by the author(s)


  • HAL Id : hal-01551648, version 1


Gabriel Barbosa Da Fonseca, Gabriel Sargent, Izabela Lyon Freire, Ronan Sicre, Zenilton Patrocinio, et al.. Tag Propagation Approaches within Speaking Face Graphs for Multimodal Person Discovery. International workshop on Content-Based Multimedia Indexing (CBMI), Jun 2017, Firenze, Italy. ⟨hal-01551648⟩



Record views


Files downloads