Robust Neural Networks using Randomized Adversarial Training

Since the discovery of adversarial examples in machine learning, researchers have designed several techniques to train neural networks that are robust against different types of attacks (most notably ∞ and 2 based attacks). However , it has been observed that the defense mechanisms designed to protect against one type of attack often offer poor performance against the other. In this paper, we introduce Randomized Adversarial Training (RAT), a technique that is efficient both against 2 and ∞ attacks. To obtain this result, we build upon adversarial training, a technique that is efficient against ∞ attacks, and demonstrate that adding random noise at training and inference time further improves performance against 2 attacks. We then show that RAT is as efficient as adversarial training against ∞ attacks while being robust against strong 2 attacks. Our final comparative experiments demonstrate that RAT outperforms all state-of-the-art approaches against 2 and ∞ attacks.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

1903.10219.pdf (352.91 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

benjamin negrevergne : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02380184

Soumis le : mardi 26 novembre 2019-10:35:21

Dernière modification le : vendredi 19 avril 2024-16:18:54

Dates et versions

hal-02380184 , version 1 (26-11-2019)

hal-02380184 , version 2 (06-02-2020)

Identifiants

HAL Id : hal-02380184 , version 1

Citer

Alexandre Araujo, Rafael Pinot, Benjamin Negrevergne, Laurent Meunier, Yann Chevaleyre, et al.. Robust Neural Networks using Randomized Adversarial Training. 2019. ⟨hal-02380184v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

194 Consultations

531 Téléchargements