Robust Anomaly Detection on Unreliable Data

Zilong Zhao; Sophie Cerf; Robert Birke; Bogdan Robu; Sara Bouchenak; Sonia Ben Mokhtar; Lydia Y. Chen

doi:10.1109/DSN.2019.00068

Communication Dans Un Congrès Année : 2019

Robust Anomaly Detection on Unreliable Data

(1) , (1) , (2) , (1) , (3) , (3) , (4)

1
2
3
4

Zilong Zhao

Fonction : Auteur
PersonId : 172877
IdHAL : zilong-zhao
IdRef : 255142862

GIPSA - Systèmes non linéaires et complexité

Sophie Cerf

Fonction : Auteur
PersonId : 169879
IdHAL : sophie-cerf
ORCID : 0000-0003-0122-0796

GIPSA - Systèmes non linéaires et complexité

Robert Birke

Fonction : Auteur
PersonId : 993251

ABB Corporate Research [Västerås]

Bogdan Robu

Fonction : Auteur
PersonId : 747277
IdHAL : bogdan-robu
ORCID : 0000-0001-7568-007X
IdRef : 156193779

GIPSA - Systèmes non linéaires et complexité

Sara Bouchenak

Fonction : Auteur
PersonId : 6304
IdHAL : sara-bouchenak
IdRef : 179480510

Distribution, Recherche d'Information et Mobilité

Sonia Ben Mokhtar

Fonction : Auteur
PersonId : 4352
IdHAL : sonia-ben-mokhtar
ORCID : 0000-0003-2821-7714
IdRef : 121974146

Distribution, Recherche d'Information et Mobilité

Lydia Y. Chen

Fonction : Auteur
PersonId : 1032373

Delft University of Technology

Résumé

Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT and cloud, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the field can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper, we present a two-layer learning framework for robust anomaly detection (RAD) in the presence of unreliable anomaly labels. The first layer of quality model filters the suspicious data, where the second layer of classification model detects the anomaly types. We specifically focus on two use cases, (i) detecting 10classes of IoT attacks and (ii) predicting 4 classes of task failures of big data jobs. Our evaluation results show that RAD can robustly improve the accuracy of anomaly detection, to reach up to 98% for IoT device attacks (i.e., +11%) and up to 83% for cloud task failures (i.e., +20%), under a significant percentage of altered anomaly labels.

Mots clés

Anomaly Detection Failures Machine Learning Attacks Unreliable Data

Domaines

Intelligence artificielle [cs.AI] Systèmes et contrôle [cs.SY] Environnements Informatiques pour l'Apprentissage Humain

Fichier principal

dsn2019.pdf (512.59 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Zilong ZHAO : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02056558

Soumis le : vendredi 6 mars 2020-15:40:22

Dernière modification le : jeudi 4 avril 2024-21:08:28

Archivage à long terme le : dimanche 7 juin 2020-14:55:26

Dates et versions

hal-02056558 , version 1 (06-03-2020)

Identifiants

HAL Id : hal-02056558 , version 1
DOI : 10.1109/DSN.2019.00068

Citer

Zilong Zhao, Sophie Cerf, Robert Birke, Bogdan Robu, Sara Bouchenak, et al.. Robust Anomaly Detection on Unreliable Data. DSN 2019 - 49th IEEE/IFIP International Conference on Dependable Systems and Networks, Jun 2019, Portland, Oregon, United States. ⟨10.1109/DSN.2019.00068⟩. ⟨hal-02056558⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA TICE CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON GIPSA GIPSA-DA GIPSA-SYSCO LIRIS INSA-GROUPE UDL

945 Consultations

1486 Téléchargements

Robust Anomaly Detection on Unreliable Data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager