Evaluation of video activity localizations integrating quality and quantity measurements

Abstract : Evaluating the performance of computer vision algorithms is classically done by reporting classification error or accuracy, if the problem at hand is the classification of an object in an image, the recognition of an activity in a video or the categorization and labeling of the image or video. If in addition the detection of an item in an image or a video, and/or its localization are required, frequently used metrics are Recall and Precision, as well as ROC curves. These metrics give quantitative performance values which are easy to understand and to interpret even by non-experts. However, an inherent problem is the dependency of quantitative performance measures on the quality constraints that we need impose on the detection algorithm. In particular, an important quality parameter of these measures is the spatial or spatio-temporal overlap between a ground-truth item and a detected item, and this needs to be taken into account when interpreting the results. We propose a new performance metric addressing and unifying the qualitative and quantitative aspects of the performance measures. The performance of a detection and recognition algorithm is illustrated intuitively by performance graphs which present quantitative performance values, like Recall, Precision and F-Score, depending on quality constraints of the detection. In order to compare the performance of different computer vision algorithms, a representative single performance measure is computed from the graphs, by integrating out all quality parameters. The evaluation method can be applied to different types of activity detection and recognition algorithms. The performance metric has been tested on several activity recognition algorithms participating in the ICPR 2012 HARL competition.
Type de document :
Article dans une revue
Computer Vision and Image Understanding, Elsevier, 2014, 127, pp.14-30. <10.1016/j.cviu.2014.06.014>
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01283866
Contributeur : Julien Mille <>
Soumis le : mardi 14 mars 2017 - 12:17:47
Dernière modification le : vendredi 17 mars 2017 - 01:08:33

Fichier

Liris-6807.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Christian Wolf, Eric Lombardi, Julien Mille, Oya Celiktutan, Mingyuan Jiu, et al.. Evaluation of video activity localizations integrating quality and quantity measurements. Computer Vision and Image Understanding, Elsevier, 2014, 127, pp.14-30. <10.1016/j.cviu.2014.06.014>. <hal-01283866>

Partager

Métriques

Consultations de
la notice

193

Téléchargements du document

35