ShaResNet: reducing residual network parameter number by sharingweights

Alexandre Boulch

doi:10.1016/j.patrec.2018.01.006

Article Dans Une Revue Pattern Recognition Letters Année : 2018

ShaResNet: reducing residual network parameter number by sharingweights

(1)

Alexandre Boulch

Fonction : Auteur
PersonId : 4315
IdHAL : boulch-alexandre
ORCID : 0000-0002-4196-9665
IdRef : 184109434

ONERA - The French Aerospace Lab [Palaiseau]

Résumé

Deep Residual Networks have reached the state of the art in many image processing tasks such image classification. However, the cost for a gain in accuracy in terms of depth and memory is prohibitive as it requires a higher number of residual blocks, up to double the initial value. To tackle this problem, we propose in this paper a way to reduce the redundant information of the networks. We share the weights of convolutional layers between residual blocks operating at the same spatial scale. The signal flows multiple times in the same convolutional layer. The resulting architecture, called ShaResNet, contains block specific layers and shared layers. These ShaResNet are trained exactly in the same fashion as the commonly used residual networks. We show, on the one hand, that they are almost as efficient as their sequential counterparts while involving less parameters, and on the other hand that they are more efficient than a residual network with the same number of parameters. For example, a 152-layer-deep residual network can be reduced to 106 convolutional layers, i.e. a parameter gain of 39%, while loosing less than 0.2% accuracy on ImageNet.

Mots clés

NEURAL NETWORK

APPRENTISSAGE AUTOMATIQUE RESEAU NEURONAL

Domaines

Base de données [cs.DB]

Fichier principal

DTIS18023.1518082767_postprint.pdf (829.85 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Cécile André : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01708867

Soumis le : vendredi 16 février 2018-09:51:58

Dernière modification le : vendredi 14 avril 2023-16:46:04

Archivage à long terme le : mardi 8 mai 2018-02:35:08

Dates et versions

hal-01708867 , version 1 (16-02-2018)

Identifiants

HAL Id : hal-01708867 , version 1
DOI : 10.1016/j.patrec.2018.01.006

Citer

Alexandre Boulch. ShaResNet: reducing residual network parameter number by sharingweights. Pattern Recognition Letters, 2018, page 53 - 59. ⟨10.1016/j.patrec.2018.01.006⟩. ⟨hal-01708867⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ONERA UNIV-PARIS-SACLAY

96 Consultations

277 Téléchargements

ShaResNet: reducing residual network parameter number by sharingweights

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager