Adversarial frontier stitching for remote neural network watermarking

Erwan Le Merrer; Patrick Pérez; Gilles Trédan

doi:10.1007/s00521-019-04434-z

Article Dans Une Revue Neural Computing and Applications Année : 2019

Adversarial frontier stitching for remote neural network watermarking

(1) , (2) , (3)

1
2
3

Erwan Le Merrer

Fonction : Auteur
PersonId : 1051352

the World Is Distributed Exploring the tension between scale and coordination

Patrick Pérez

Fonction : Auteur
PersonId : 1022281

Technicolor [Cesson Sévigné]

Gilles Trédan

Fonction : Auteur
PersonId : 14277
IdHAL : gilles-tredan
IdRef : 119990385

Équipe Tolérance aux fautes et Sûreté de Fonctionnement informatique

Résumé

The state-of-the-art performance of deep learning models comes at a high cost for companies and institutions, due to the tedious data collection and the heavy processing requirements. Recently, Nagai et al. (Int J Multimed Inf Retr 7(1):3–16, 2018), Uchida et al. (Embedding watermarks into deep neural networks, ICMR, 2017) proposed to watermark convolutional neural networks for image classification, by embedding information into their weights. While this is a clear progress toward model protection, this technique solely allows for extracting the watermark from a network that one accesses locally and entirely. Instead, we aim at allowing the extraction of the watermark from a neural network (or any other machine learning model) that is operated remotely, and available through a service API. To this end, we propose to mark the model’s action itself, tweaking slightly its decision frontiers so that a set of specific queries convey the desired information. In the present paper, we formally introduce the problem and propose a novel zero-bit watermarking algorithm that makes use of adversarial model examples. While limiting the loss of performance of the protected model, this algorithm allows subsequent extraction of the watermark using only few queries. We experimented the approach on three neural networks designed for image classification, in the context of MNIST digit recognition task.

Mots clés

Watermarking Neural network models Black box interaction Adversarial examples Model decision frontiers

Domaines

Réseau de neurones [cs.NE] Cryptographie et sécurité [cs.CR]

Fichier principal

main-nca.pdf (385.06 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Erwan Le Merrer : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02264449

Soumis le : mercredi 7 août 2019-09:26:15

Dernière modification le : lundi 20 novembre 2023-11:44:22

Archivage à long terme le : jeudi 9 janvier 2020-01:41:19

Dates et versions

hal-02264449 , version 1 (07-08-2019)

Identifiants

HAL Id : hal-02264449 , version 1
DOI : 10.1007/s00521-019-04434-z

Citer

Erwan Le Merrer, Patrick Pérez, Gilles Trédan. Adversarial frontier stitching for remote neural network watermarking. Neural Computing and Applications, 2019, 32 (13), pp.9233-9244. ⟨10.1007/s00521-019-04434-z⟩. ⟨hal-02264449⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 UNIV-RENNES1 CNRS INRIA INSA-RENNES INSA-TOULOUSE LAAS IRISA LAAS-TSF UT1-CAPITOLE LAAS-INFORMATIQUE-CRITIQUE INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE LAAS-RISC UR1-MATH-NUM TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

207 Consultations

329 Téléchargements

Adversarial frontier stitching for remote neural network watermarking

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager