Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Learning Sparse deep neural networks using efficient structured projections on convex constraints for green AI

Abstract : Deep neural networks (DNN) have been applied recently to different domains and perform better than classical state-of-the-art methods. However the high level of performances of DNNs is most often obtained with networks containing millions of parameters and for which training requires substantial computational power. To deal with this computational issue proximal regularization methods have been proposed in the literature but they are time consuming.\\ In this paper, we propose instead a constrained approach. We provide the general framework for this new projection gradient method. Our algorithm iterates a gradient step and a projection on convex constraints. We studied algorithms for different constraints: the classical $\ell_1$ unstructured constraint and structured constraints such as the $\ell_{2,1} $ constraint (Group LASSO). We propose a new $\ell_{1,1} $ structured constraint for which we provide a new projection algorithm. Finally, we used the recent "Lottery optimizer" replacing the threshold by our $\ell_{1,1} $ projection. We demonstrate the effectiveness of this method with three popular datasets (MNIST, Fashion MNIST and CIFAR). Experiments with these datasets show that our projection method using this new $\ell_{1,1} $ structured constraint provides the best decrease in memory and computational power.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

Cited literature [60 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02556382
Contributor : Michel Barlaud <>
Submitted on : Wednesday, October 28, 2020 - 8:16:36 AM
Last modification on : Thursday, October 29, 2020 - 10:56:04 AM

File

DNN-ICPR-Milan.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02556382, version 3

Citation

Michel Barlaud, Frederic Guyard. Learning Sparse deep neural networks using efficient structured projections on convex constraints for green AI. 2020. ⟨hal-02556382v3⟩

Share