Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015

Johann Poignant; Hervé Bredin; Claude Barras

doi:10.1007/s11042-017-4730-x

Article Dans Une Revue Multimedia Tools and Applications Année : 2017

Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015

(1) , (1) , (1)

Johann Poignant

Fonction : Auteur

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Hervé Bredin

Fonction : Auteur
PersonId : 15856
IdHAL : hbredin
ORCID : 0000-0002-3739-925X
IdRef : 121165779

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Claude Barras

Fonction : Auteur
PersonId : 17217
IdHAL : claude-barras
IdRef : 165065583

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

We describe the " Multimodal Person Discovery in Broadcast TV " task of MediaEval 2015 benchmarking initiative. Participants were asked to return the names of people who can be both seen as well as heard in every shot of a collection of videos. The list of people was not known a priori and their names had to be discovered in an unsupervised way from media content using text overlay or speech transcripts. The task was evaluated using information retrieval metrics, based on a posteriori collaborative annotation of the test corpus. The first edition of the task gathered 9 teams which submitted 34 runs. This paper provides quantitative and qualitative comparisons of participants submissions. We also investigate why all systems failed for particular shots, paving the way for future promising research directions.

Mots clés

benchmark information retrieval unsupervised person recognition multimodal fusion error analysis

Domaines

Informatique [cs] Multimédia [cs.MM]

Fichier principal

Poignant2017.pdf (2.12 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Claude Barras : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01690581

Soumis le : mardi 23 janvier 2018-11:07:57

Dernière modification le : samedi 7 octobre 2023-21:36:20

Archivage à long terme le : jeudi 24 mai 2018-11:05:19

Dates et versions

hal-01690581 , version 1 (23-01-2018)

Identifiants

HAL Id : hal-01690581 , version 1
DOI : 10.1007/s11042-017-4730-x

Citer

Johann Poignant, Hervé Bredin, Claude Barras. Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015. Multimedia Tools and Applications, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩. ⟨hal-01690581⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE ANR LISN GS-ENGINEERING GS-COMPUTER-SCIENCE

78 Consultations

97 Téléchargements

Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager