Nonnegative matrix factorization for unsupervised audiovisual document structuring - Archive ouverte HAL Accéder directement au contenu
Rapport Année : 2011

Nonnegative matrix factorization for unsupervised audiovisual document structuring

Résumé

This paper introduces a new paradigm for unsupervised audiovisual document structuring. In this paradigm, a novel Nonnegative Matrix Factorization (NMF) algorithm is applied on histograms of counts (relating to a bag of features representation of the content) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved for using a majorization-minimization technique. The approach proposed is meant to be generic and is particularly well suited to applications where the structuring patterns may overlap in time. As such, it is evaluated on a person-oriented video structuring task, using a challenging database of political debate videos. Our results outperform reference results obtained by a method using Hidden Markov Models. Further, we show the potential that our general approach has for audio speaker diarization.
Fichier principal
Vignette du fichier
nmf-vstruct-hal.pdf (1.41 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00605886 , version 1 (04-07-2011)

Identifiants

  • HAL Id : hal-00605886 , version 1

Citer

Slim Essid, Cédric Févotte. Nonnegative matrix factorization for unsupervised audiovisual document structuring. 2011. ⟨hal-00605886⟩
228 Consultations
282 Téléchargements

Partager

Gmail Facebook X LinkedIn More