Unsupervised mining of multiple audiovisually consistent clusters for video structure analysis - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Unsupervised mining of multiple audiovisually consistent clusters for video structure analysis

Résumé

We address the problem of detecting multiple audiovisual events related to the edit structure of a video by incorporating an unsupervised cluster analysis technique into a cluster selection method designed to measure coherence between audio and visual segments. First, mutual information measure is used to select audio-visually consistent clusters from two dendrograms representing hierarchical clustering results respectively for the audio and visual modalities. A cluster analysis technique is then applied to define events from the audio-visual (AV) clusters with segments co-occurring frequently. Candidate events are then characterized by groups of AV clusters from which models are built by automatically selecting positive and negative examples. Experiments on the standard Canal9 data set demonstrates that our method is capable of discovering multiple audiovisual events in a totally unsupervised manner.
Fichier principal
Vignette du fichier
ICME2012.pdf (202.32 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00718985 , version 1 (18-07-2012)

Identifiants

  • HAL Id : hal-00718985 , version 1

Citer

Anh-Phuong Ta, Guillaume Gravier. Unsupervised mining of multiple audiovisually consistent clusters for video structure analysis. ICME - International Conference on Multimedia and Exhibition, 2012, Australia. ⟨hal-00718985⟩
259 Consultations
251 Téléchargements

Partager

Gmail Facebook X LinkedIn More