A multi-cue spatio-temporal framework for automatic frontal face clustering in video sequences

Siméon Schwab; Thierry Chateau; Christophe Blanc; Laurent Trassoudaine

doi:10.1186/1687-5281-2013-10

Article Dans Une Revue EURASIP Journal on Image and Video Processing Année : 2013

A multi-cue spatio-temporal framework for automatic frontal face clustering in video sequences

(1) , (2) , (3) , (2)

1
2
3

Siméon Schwab

Fonction : Auteur

Laboratoire des Adaptations Métaboliques à l'Exercice en Conditions Physiologiques et Pathologiques

Thierry Chateau

Fonction : Auteur
PersonId : 8056
IdHAL : thierry-chateau
ORCID : 0000-0003-4854-5686
IdRef : 154402176

Laboratoire des sciences et matériaux pour l'électronique et d'automatique

Christophe Blanc

Fonction : Auteur
PersonId : 17245
IdHAL : christophe-blanc-ip

Institut Pascal

Laurent Trassoudaine

Fonction : Auteur
PersonId : 16911
IdHAL : laurent-trassoudaine
ORCID : 0000-0002-3486-3918
IdRef : 13160211X

Laboratoire des sciences et matériaux pour l'électronique et d'automatique

Résumé

Clustering of specific object detections is a challenging problem for video summarization. In this article, we present a method to form tracks by grouping face detections of a video sequence. Our clustering method is based on a probabilistic maximum a posteriori data association framework, and we apply it to face detection in a visual surveillance context. Optimal solution is found with a procedure using network-flow algorithms described in previous pedestrian tracking-by-detection works. To address difficult cases of small detections in scenes with multiple moving people, given that face detections are located in a video sequence, we use dissimilarities involving appearance and spatio-temporal information. The main contribution is the use of an optical flow or local front-back tracking to handle complex situations appearing in real sequences. The resulting algorithm is then able to deal with situations where people are crossing one another and face detections are scattered due to head rotation. The clustering step of our framework is compared to generic clustering methods (hierarchical clustering and affinity propagation) on several real challenging sequences, as evaluations indicate that this is more adapted to video-based detection clustering. We propose to use a new evaluation criteria, derived from purity and inverse purity of a clustering estimation, to assess performances of such methods. Results also show that optical flow and a skin color prior added to face detections improve the clustering quality.

Mots clés

Clustering Face detection Multiple visual tracking Optical flow Maximum a posteriori

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

1687-5281-2013-10.pdf (4.5 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Laurent Trassoudaine : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01877613

Soumis le : mercredi 19 décembre 2018-12:27:08

Dernière modification le : samedi 22 avril 2023-04:24:24

Archivage à long terme le : mercredi 20 mars 2019-18:35:08

Dates et versions

hal-01877613 , version 1 (19-12-2018)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

HAL Id : hal-01877613 , version 1
DOI : 10.1186/1687-5281-2013-10

Citer

Siméon Schwab, Thierry Chateau, Christophe Blanc, Laurent Trassoudaine. A multi-cue spatio-temporal framework for automatic frontal face clustering in video sequences. EURASIP Journal on Image and Video Processing, 2013, 2013 (1), pp.10. ⟨10.1186/1687-5281-2013-10⟩. ⟨hal-01877613⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PRES_CLERMONT CNRS INSTITUT_PASCAL AME2P

52 Consultations

32 Téléchargements

A multi-cue spatio-temporal framework for automatic frontal face clustering in video sequences

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager