Learning the Structure of Deep Architectures Using L1 Regularization

Praveen Kulkarni; Joaquin Zepeda; Frédéric Jurie; Patrick Pérez; Louis Chevallier

doi:10.5244/C.29.23

Communication Dans Un Congrès Année : 2015

Learning the Structure of Deep Architectures Using L1 Regularization

(1) , (1) , (2) , (1) , (1)

1
2

Praveen Kulkarni

Fonction : Auteur
PersonId : 975955

Technicolor R & I [Cesson Sévigné]

Joaquin Zepeda

Fonction : Auteur
PersonId : 883135

Technicolor R & I [Cesson Sévigné]

Frédéric Jurie

Fonction : Auteur
PersonId : 3233
IdHAL : frederic-jurie
ORCID : 0000-0002-2686-0020
IdRef : 080485022

Equipe Image - Laboratoire GREYC - UMR6072

Patrick Pérez

Fonction : Auteur
PersonId : 1022281

Technicolor R & I [Cesson Sévigné]

Louis Chevallier

Fonction : Auteur
PersonId : 940855

Technicolor R & I [Cesson Sévigné]

Résumé

We present a method that formulates the selection of the structure of a deep architecture as a penalized, discriminative learning problem. Up to now, the structure of deep architectures has been fixed by hand, and only the weights are learned using discriminative learning. Our work is a first attempt towards a more formal method of deep structure selection. We consider architectures consisting only of fully-connected layers, and our approach relies on diagonal matrices inserted between subsequent layers. By including an L1 norm of the diagonal entries of said matrices as a regularization penalty, we force the diagonals to be sparse, accordingly selecting the effective number of rows (respectively, columns) of the corresponding layer's (next layer's) weights matrix. We carry out experiments on a standard dataset and show that our method succeeds in selecting the structure of deep architectures of multiple layers. One variant of our architecture results in a feature vector of size as little as $36$, while retaining very high image classification performance.

Domaines

Traitement des images [eess.IV]

Fichier principal

hal-01266462.pdf (261.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Greyc Référent : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01266462

Soumis le : vendredi 13 octobre 2017-12:45:43

Dernière modification le : mercredi 20 mars 2024-16:20:04

Archivage à long terme le : dimanche 14 janvier 2018-13:21:13

Dates et versions

hal-01266462 , version 1 (13-10-2017)

Identifiants

HAL Id : hal-01266462 , version 1
DOI : 10.5244/C.29.23

Citer

Praveen Kulkarni, Joaquin Zepeda, Frédéric Jurie, Patrick Pérez, Louis Chevallier. Learning the Structure of Deep Architectures Using L1 Regularization. British Machine Vision Conference, 2015, Sep 2015, swansea, United Kingdom. ⟨10.5244/C.29.23⟩. ⟨hal-01266462⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS GREYC GREYC-IMAGE COMUE-NORMANDIE ENSICAEN UNICAEN

272 Consultations

283 Téléchargements

Learning the Structure of Deep Architectures Using L1 Regularization

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager