Adversarial vulnerability for any classifier

Alhussein Fawzi; Hamza Fawzi; Omar Fawzi

Communication Dans Un Congrès Année : 2018

Adversarial vulnerability for any classifier

(1) , (2) , (3, 4)

1
2
3
4

Alhussein Fawzi

Fonction : Auteur

UCLA Vision Lab

Hamza Fawzi

Fonction : Auteur

Department of Applied Mathematics and Theoretical Physics

Omar Fawzi

Fonction : Auteur
PersonId : 171774
IdHAL : omar-fawzi
ORCID : 0000-0001-8491-0359
IdRef : 193611368

Laboratoire de l'Informatique du Parallélisme

Modèles de calcul, Complexité, Combinatoire

Résumé

Despite achieving impressive performance, state-of-the-art classifiers remain highly vulnerable to small, imperceptible, adversarial perturbations. This vulnerability has proven empirically to be very intricate to address. In this paper, we study the phenomenon of adversarial perturbations under the assumption that the data is generated with a smooth generative model. We derive fundamental upper bounds on the robustness to perturbations of any classification function, and prove the existence of adversarial perturbations that transfer well across different classifiers with small risk. Our analysis of the robustness also provides insights onto key properties of generative models, such as their smoothness and dimensionality of latent space. We conclude with numerical experimental results showing that our bounds provide informative baselines to the maximal achievable robustness on several datasets.

Domaines

Apprentissage [cs.LG]

Omar Fawzi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01990465

Soumis le : mercredi 23 janvier 2019-10:24:26

Dernière modification le : jeudi 11 mai 2023-11:56:10

Dates et versions

hal-01990465 , version 1 (23-01-2019)

Identifiants

HAL Id : hal-01990465 , version 1
ARXIV : 1802.08686

Citer

Alhussein Fawzi, Hamza Fawzi, Omar Fawzi. Adversarial vulnerability for any classifier. NeuroIPS 2018, Dec 2018, Montreal, Canada. ⟨hal-01990465⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA UNIV-LYON1 UDL

59 Consultations

0 Téléchargements

Adversarial vulnerability for any classifier

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager