SSIG and IRISA at Multimodal Person Discovery

Abstract : This paper describes our approach and results in the multi-modal person discovery in broadcast TV task at MediaEval 2015. We investigate two distinct aspects of multimodal person discovery. One refers to face clusters, which are considered to propagate names associated to faces in one shot to other faces that probably belong to the same person. The face clustering approach consists in calculating face similarities using partial least squares (PLS) and a simple hierarchical approach. The other aspect refers to tag propagation in a graph-based approach where nodes are speaking faces and edges link similar faces/speakers. The advantage of the graph-based tag propagation is to not rely on face/speaker clustering, which we believe can be errorprone.
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01196171
Contributeur : Guillaume Gravier <>
Soumis le : mercredi 9 septembre 2015 - 11:37:04
Dernière modification le : jeudi 15 novembre 2018 - 11:58:51
Document(s) archivé(s) le : lundi 28 décembre 2015 - 23:09:54

Fichier

mediaeval.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-01196171, version 1

Citation

Cassio Dos Santos Jr., Guillaume Gravier, William Robson Schwartz. SSIG and IRISA at Multimodal Person Discovery. Working Notes Proceedings of the MediaEval Workshop, 2015, Wurzen, Germany. 〈hal-01196171〉

Partager

Métriques

Consultations de la notice

1088

Téléchargements de fichiers

155