Towards large scale multimedia indexing: A case study on person discovery in broadcast news

Abstract : The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audiovisual documents essential for searching archives. Person discovery in the absence of prior identity knowledge requires accurate association of audiovisual cues and detected names. To this end, we present 3 different strategies to approach this problem: clustering-based naming, verification-based naming, and graph-based naming. Each of these strategies utilizes different recent advances in unsupervised face / speech representation, verification, and optimization. To have a better understanding of the approaches, this paper also provides a quantitative and qualitative comparative study of these approaches using the associated corpus of the Person Discovery challenge at MediaEval 2016. From the results of our experiments, we can observe the pros and cons of each approach, thus paving the way for future promising research directions.
Liste complète des métadonnées

Littérature citée [33 références]  Voir  Masquer  Télécharger
Contributeur : Gabriel Sargent <>
Soumis le : vendredi 30 juin 2017 - 14:28:16
Dernière modification le : mercredi 21 février 2018 - 01:27:23
Document(s) archivé(s) le : lundi 22 janvier 2018 - 20:15:49


Fichiers produits par l'(les) auteur(s)



Nam Le, Hervé Bredin, Gabriel Sargent, Miquel India, Paula Lopez-Otero, et al.. Towards large scale multimedia indexing: A case study on person discovery in broadcast news. Content-Based Multimedia Indexing CBMI, Jun 2017, Firenze, Italy. 2017, Proceedings of the 15th international workshop on Content-Based Multimedia Indexing. 〈10.1145/3095713.3095732〉. 〈hal-01551690〉



Consultations de la notice


Téléchargements de fichiers