SSIG and IRISA at Multimodal Person Discovery

Abstract : This paper describes our approach and results in the multi-modal person discovery in broadcast TV task at MediaEval 2015. We investigate two distinct aspects of multimodal person discovery. One refers to face clusters, which are considered to propagate names associated to faces in one shot to other faces that probably belong to the same person. The face clustering approach consists in calculating face similarities using partial least squares (PLS) and a simple hierarchical approach. The other aspect refers to tag propagation in a graph-based approach where nodes are speaking faces and edges link similar faces/speakers. The advantage of the graph-based tag propagation is to not rely on face/speaker clustering, which we believe can be errorprone.
Liste complète des métadonnées

Cited literature [9 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01196171
Contributor : Guillaume Gravier <>
Submitted on : Wednesday, September 9, 2015 - 11:37:04 AM
Last modification on : Thursday, November 15, 2018 - 11:58:51 AM
Document(s) archivé(s) le : Monday, December 28, 2015 - 11:09:54 PM

File

mediaeval.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01196171, version 1

Citation

Cassio Dos Santos Jr., Guillaume Gravier, William Robson Schwartz. SSIG and IRISA at Multimodal Person Discovery. Working Notes Proceedings of the MediaEval Workshop, 2015, Wurzen, Germany. ⟨hal-01196171⟩

Share

Metrics

Record views

1172

Files downloads

160