Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis

Mathieu Ben; Guillaume Gravier

Communication Dans Un Congrès Année : 2011

Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis

(1) , (1)

Mathieu Ben

Fonction : Auteur

Multimedia content-based indexing

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Multimedia content-based indexing

Résumé

In this paper, a multimodal event mining technique is proposed to discover repeating video segments exhibiting audio and visual consistency in a totally unsupervised manner. The mining strategy first exploits independent audio and visual cluster analysis to provide segments which are consistent in both their visual and audio modalities, thus likely corresponding to a unique underlying event. A subsequent modeling stage using discriminative models enables accurate detection of the underlying event throughout the video. Event mining is applied to unsupervised video structure analysis, using simple heuristics on occurrence patterns of the events discovered to select those relevant to the video structure. Results on TV programs ranging from news to talk shows and games, show that structurally relevant events are discovered with precisions ranging from 87% to 98% and recalls from 59% to 94%.

Domaines

Multimédia [cs.MM]

Fichier principal

paper_434.pdf (330.28 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Guillaume Gravier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00646603

Soumis le : mercredi 30 novembre 2011-13:26:43

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : jeudi 1 mars 2012-02:30:27

Dates et versions

hal-00646603 , version 1 (30-11-2011)

Identifiants

HAL Id : hal-00646603 , version 1

Citer

Mathieu Ben, Guillaume Gravier. Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis. IEEE Intl. Conf. on Multimedia and Exhibition, 2011, Spain. ⟨hal-00646603⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D6 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

146 Consultations

157 Téléchargements

Unsupervised mining of audiovisually consistent segments in videos with application to structure analysis

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager