Skip to Main content Skip to Navigation
Conference papers

Tag Propagation Approaches within Speaking Face Graphs for Multimodal Person Discovery

Abstract : The indexing of broadcast TV archives is a current problem in multimedia research. As the size of these databases grows continuously, meaningful features are needed to describe and connect their elements efficiently, such as the identification of speaking faces. In this context, this paper focuses on two approaches for unsupervised person discovery. Initial tagging of speaking faces is provided by an OCR-based method, and these tags propagate through a graph model based on audiovisual relations between speaking faces. Two propagation methods are proposed, one based on random walks and the other based on a hierarchical approach. To better evaluate their performances, these methods were compared with two graph clustering baselines. We also study the impact of different modality fusions on the graph-based tag propagation scenario. From a quantitative analysis, we observed that the graph propagation techniques always outperform the baselines. Among all compared strategies, the methods based on hierarchical propagation with late fusion and random walk with score-fusion obtained the highest MAP values. Finally, even though these two methods produce highly equivalent results according to Kappa coefficient, the random walk method performs better according to a paired t-test, and the computing time for the hierarchical propagation is more than 4 times lower than the one for the random walk propagation.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01551648
Contributor : Gabriel Sargent <>
Submitted on : Friday, June 30, 2017 - 2:04:54 PM
Last modification on : Tuesday, February 25, 2020 - 8:08:12 AM
Document(s) archivé(s) le : Monday, January 22, 2018 - 8:38:40 PM

File

2017-conf-cbmi-tag-propagation...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01551648, version 1

Citation

Gabriel Barbosa Da Fonseca, Gabriel Sargent, Izabela Lyon Freire, Ronan Sicre, Zenilton Patrocinio, et al.. Tag Propagation Approaches within Speaking Face Graphs for Multimodal Person Discovery. International workshop on Content-Based Multimedia Indexing (CBMI), Jun 2017, Firenze, Italy. ⟨hal-01551648⟩

Share

Metrics

Record views

981

Files downloads

334