Steganalysis into the Wild: How to Define a Source?

Abstract : It is now well known that practical steganalysis using machine learning techniques can be strongly biased by the problem of Cover Source Mismatch. Such a phenomenon usually occurs in machine learning when the training and the testing sets are drawn from different sources, i.e. when they do not share the same statistical properties. In the field of steganalysis however, due to the small power of the signal targeted by steganalysis methods, it can drastically lower their performance. This paper aims to define through practical experiments what is a source in steganalysis. By assuming that two cover datasets coming from a common source should provide comparable performances in steganalysis, it is shown that the definition of a source is more related with the processing pipeline of the RAW images than with the sensor or the acquisition setup of the pictures. In order to measure the discrepancy between sources, this paper introduces the concept of consistency between sources, that quantifies how much two sources are subject to Cover Source Mismatch. We show that by adopting "training de-sign", we can increase the consistency between the training set and the testing set. To measure how much image processing operation may help the steganographers this paper also introduces the intrinsic difficulty of a source. It is observed that some processes such as JPEG quan-tization tables or the development pipeline can dramatically increase or decrease the performance of steganalysis methods and that other parameters such as the ISO sensitivity or the sensor model have minor impact on the performance.
Type de document :
Communication dans un congrès
IS&T Electronic Imaging, Media Watermarking, Security, and Forensics 2018, Jan 2018, Burlingame, CA, United States
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01685050
Contributeur : Rémi Cogranne <>
Soumis le : mardi 16 janvier 2018 - 08:54:17
Dernière modification le : mardi 27 février 2018 - 14:40:03
Document(s) archivé(s) le : dimanche 6 mai 2018 - 13:35:28

Fichier

steganalysis-wild-define_vFina...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01685050, version 1

Collections

UGA

Relations

Citation

Quentin Giboulot, Rémi Cogranne, Patrick Bas. Steganalysis into the Wild: How to Define a Source?. IS&T Electronic Imaging, Media Watermarking, Security, and Forensics 2018, Jan 2018, Burlingame, CA, United States. 〈hal-01685050〉

Partager

Métriques

Consultations de la notice

177

Téléchargements de fichiers

24