How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?

Nicolas Goix

Pré-Publication, Document De Travail Année : 2016

How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?

(1)

Nicolas Goix

Fonction : Auteur
PersonId : 2553
IdHAL : nicolas-goix
IdRef : 203550374

Laboratoire Traitement et Communication de l'Information

Résumé

When sufficient labeled data are available, classical criteria based on Receiver Operating Characteristic (ROC) or Precision-Recall (PR) curves can be used to compare the performance of un-supervised anomaly detection algorithms. However , in many situations, few or no data are labeled. This calls for alternative criteria one can compute on non-labeled data. In this paper, two criteria that do not require labels are empirically shown to discriminate accurately (w.r.t. ROC or PR based criteria) between algorithms. These criteria are based on existing Excess-Mass (EM) and Mass-Volume (MV) curves, which generally cannot be well estimated in large dimension. A methodology based on feature sub-sampling and aggregating is also described and tested, extending the use of these criteria to high-dimensional datasets and solving major drawbacks inherent to standard EM and MV curves.

Domaines

Machine Learning [stat.ML]

Fichier principal

icml_workshop_with_suppl.pdf (1.74 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Nicolas Goix : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01341809

Soumis le : lundi 4 juillet 2016-19:58:08

Dernière modification le : lundi 9 octobre 2023-12:49:40

Archivage à long terme le : mercredi 5 octobre 2016-14:44:58

Dates et versions

hal-01341809 , version 1 (04-07-2016)

Identifiants

HAL Id : hal-01341809 , version 1
ARXIV : 1607.01152

Citer

Nicolas Goix. How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?. 2016. ⟨hal-01341809⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI

207 Consultations

716 Téléchargements

How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager