Unsupervised mining of multiple audiovisually consistent clusters for video structure analysis - Archive ouverte HAL Access content directly
Conference Papers Year : 2012

Unsupervised mining of multiple audiovisually consistent clusters for video structure analysis

Abstract

We address the problem of detecting multiple audiovisual events related to the edit structure of a video by incorporating an unsupervised cluster analysis technique into a cluster selection method designed to measure coherence between audio and visual segments. First, mutual information measure is used to select audio-visually consistent clusters from two dendrograms representing hierarchical clustering results respectively for the audio and visual modalities. A cluster analysis technique is then applied to define events from the audio-visual (AV) clusters with segments co-occurring frequently. Candidate events are then characterized by groups of AV clusters from which models are built by automatically selecting positive and negative examples. Experiments on the standard Canal9 data set demonstrates that our method is capable of discovering multiple audiovisual events in a totally unsupervised manner.
Fichier principal
Vignette du fichier
ICME2012.pdf (202.32 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00718985 , version 1 (18-07-2012)

Identifiers

  • HAL Id : hal-00718985 , version 1

Cite

Anh-Phuong Ta, Guillaume Gravier. Unsupervised mining of multiple audiovisually consistent clusters for video structure analysis. ICME - International Conference on Multimedia and Exhibition, 2012, Australia. ⟨hal-00718985⟩
259 View
254 Download

Share

Gmail Facebook X LinkedIn More