Exploring Weight Symmetry in Deep Neural Networks

Xu Shell Hu; Sergey Zagoruyko; Nikos Komodakis

doi:10.1016/j.cviu.2019.07.006

Article Dans Une Revue Computer Vision and Image Understanding Année : 2019

Exploring Weight Symmetry in Deep Neural Networks

(1, 2, 3) , (4) , (3, 5, 1, 2)

1
2
3
4
5

Xu Shell Hu

Fonction : Auteur

Laboratoire d'Informatique Gaspard-Monge

IMAGINE [Marne-la-Vallée]

École des Ponts ParisTech

Sergey Zagoruyko

Fonction : Auteur
PersonId : 1041507

Models of visual object recognition and scene understanding

Nikos Komodakis

Fonction : Auteur

École des Ponts ParisTech

Computer Science Department [Crete]

Laboratoire d'Informatique Gaspard-Monge

IMAGINE [Marne-la-Vallée]

Résumé

We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the symmetry constraints, one would expect a dramatic drop in accuracy. Surprisingly, we show that this is not the case, and, depending on network size, symmetry can have little or no negative effect on network accuracy, especially in deep overparameterized networks. We propose several ways to impose local symmetry in recurrent and convolutional neural networks, and show that our symmetry parameterizations satisfy universal approximation property for single hidden layer networks. We extensively evaluate these parameterizations on CIFAR, ImageNet and language modeling datasets, showing significant benefits from the use of symmetry. For instance, our ResNet-101 with channel-wise symmetry has almost 25% less parameters and only 0.2% accuracy loss on ImageNet. Code for our experiments is available at https://github.com/hushell/deep-symmetry

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Apprentissage [cs.LG]

Fichier principal

S107731421930102X.pdf (493.03 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Accord Elsevier CCSD : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01978633

Soumis le : mercredi 20 juillet 2022-15:05:37

Dernière modification le : vendredi 26 avril 2024-13:43:43

Archivage à long terme le : vendredi 21 octobre 2022-20:37:48

Dates et versions

hal-01978633 , version 1 (20-07-2022)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

HAL Id : hal-01978633 , version 1
ARXIV : 1812.11027
DOI : 10.1016/j.cviu.2019.07.006
PII : S1077-3142(19)30102-X

Citer

Xu Shell Hu, Sergey Zagoruyko, Nikos Komodakis. Exploring Weight Symmetry in Deep Neural Networks. Computer Vision and Image Understanding, 2019, 187, ⟨10.1016/j.cviu.2019.07.006⟩. ⟨hal-01978633⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS ENPC CNRS INRIA LIGM_A3SI INSMI PARISTECH LIGM IMAGINE INRIA2 PSL UNIV-EIFFEL JSE2024

202 Consultations

87 Téléchargements

Exploring Weight Symmetry in Deep Neural Networks

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager