Attention Based Pruning for Shift Networks

Ghouthi Boukli Hacene; Carlos Lassance; Vincent Gripon; Matthieu Courbariaux; Yoshua Bengio

doi:10.1109/ICPR48806.2021.9412859

Communication Dans Un Congrès Année : 2021

Attention Based Pruning for Shift Networks

(1) , (1) , (1) , (2) , (2)

1
2

Ghouthi Boukli Hacene

Fonction : Auteur

Département Mathematical and Electrical Engineering

Carlos Lassance

Fonction : Auteur

Département Mathematical and Electrical Engineering

Vincent Gripon

Fonction : Auteur
PersonId : 21307
IdHAL : vincent-gripon
ORCID : 0000-0002-4353-4542
IdRef : 16122203X

Département Mathematical and Electrical Engineering

Matthieu Courbariaux

Fonction : Auteur

Montreal Institute for Learning Algorithms [Montréal]

Yoshua Bengio

Fonction : Auteur

Montreal Institute for Learning Algorithms [Montréal]

Résumé

In many application domains such as computer vision, Convolutional Layers (CLs) are key to the accuracy of deep learning methods. However, it is often required to assemble a large number of CLs, each containing thousands of parameters, in order to reach state-of-the-art accuracy, thus resulting in complex and demanding systems that are poorly fitted to resource-limited devices. Recently, methods have been proposed to replace the generic convolution operator by the combination of a shift operation and a simpler 1x1 convolution. The resulting block, called Shift Layer (SL), is an efficient alternative to CLs in the sense it allows to reach similar accuracies on various tasks with faster computations and fewer parameters. In this contribution, we introduce Shift Attention Layers (SALs), which extend SLs by using an attention mechanism that learns which shifts are the best at the same time the network function is trained. We demonstrate SALs are able to outperform vanilla SLs (and CLs) on various object recognition benchmarks while significantly reducing the number of float operations and parameters for the inference.

Domaines

Apprentissage [cs.LG] Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP] Informatique et théorie des jeux [cs.GT]

Fichier principal

Attention Based Pruning for Shift Networks.pdf (506.58 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Vincent Gripon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03238771

Soumis le : vendredi 30 juillet 2021-09:17:27

Dernière modification le : mardi 28 février 2023-15:36:25

Archivage à long terme le : dimanche 31 octobre 2021-18:05:35

Dates et versions

hal-03238771 , version 1 (30-07-2021)

Identifiants

HAL Id : hal-03238771 , version 1
ARXIV : 1905.12300
DOI : 10.1109/ICPR48806.2021.9412859

Citer

Ghouthi Boukli Hacene, Carlos Lassance, Vincent Gripon, Matthieu Courbariaux, Yoshua Bengio. Attention Based Pruning for Shift Networks. ICPR 2020: 25th International Conference on Pattern Recognition, Jan 2021, Milan (virtual), Italy. pp.4054-4061, ⟨10.1109/ICPR48806.2021.9412859⟩. ⟨hal-03238771⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM IMT-ATLANTIQUE

106 Consultations

79 Téléchargements

Attention Based Pruning for Shift Networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager