On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks - Laboratoire Jean Kuntzmann Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks

Résumé

Over the past decade, some progress has been made on understanding the strengths and limitations of convolutional neural networks (CNNs) for computer vision. In particular, the stability properties with respect to small transformations (translations, rotations, scaling, deformations) are only partially understood. In this talk, we study the combined effect of convolution and max pooling layers in generating quasi-invariant representations. This property is essential for classification, since it is expected that two translated versions of the same image are classified in the same way. When trained on datasets such as ImageNet, CNNs tend to learn parameters in the first layer that closely resemble oriented band-pass filters. By leveraging the properties of discrete Gabor-like convolutions, we establish conditions under which the feature maps computed by the subsequent max pooling operator approximate the modulus of complex Gabor-like coefficients, in which case they are stable with respect to small input shifts. We then compute a probabilistic measure of shift invariance for max pooling feature maps. More specifically, we show that some filters, depending on their frequency and orientation, are more likely than others to produce stable image representations. We experimentally validate our theory by considering a deterministic feature extractor based on the dual-tree complex wavelet packet transform, a particular case of discrete Gabor-like decomposition. We demonstrate a strong correlation between shift invariance on the one hand and similarity with complex modulus on the other hand.
Workshop-ASCETE-2023.pdf (29.23 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04286275 , version 1 (15-11-2023)

Licence

Paternité

Identifiants

  • HAL Id : hal-04286275 , version 1

Citer

Hubert Leterme, Kévin Polisano, Karteek Alahari, Valérie Perrier. On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks. Workshop ASCETE 2023 - Workshop pour l'association pour les orthoptéristes et les entomocénoticiens, Sylvain Meignen, Nov 2023, Grenoble, France. ⟨hal-04286275⟩
22 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More