Unsupervised Layer-Wise Model Selection in Deep Neural Networks

Ludovic Arnold; Hélène Paugam-Moisy; Michèle Sebag

doi:10.3233/978-1-60750-606-5-915

Communication Dans Un Congrès Année : 2010

Unsupervised Layer-Wise Model Selection in Deep Neural Networks

(1, 2) , (3, 2) , (1, 2)

1
2
3

Ludovic Arnold

Fonction : Auteur
PersonId : 865134

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Hélène Paugam-Moisy

Fonction : Auteur
PersonId : 865135

Data Mining and Machine Learning

Machine Learning and Optimisation

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

Deep Neural Networks (DNN) propose a new and efficient ML architecture based on the layer-wise building of several representation layers. A critical issue for DNNs remains model selection, e.g. selecting the number of neurons in each DNN layer. The hyper-parameter search space exponentially increases with the number of layers, making the popular grid search-based approach used for finding good hyper-parameter values intractable. The question investigated in this paper is whether the unsupervised, layer-wise methodology used to train a DNN can be extended to model selection as well. The proposed approach, considering an unsupervised criterion, empirically examines whether model selection is a modular optimization problem, and can be tackled in a layer-wise manner. Preliminary results on the MNIST data set suggest the answer is positive. Further, some unexpected results regarding the optimal size of layers depending on the training process, are reported and discussed.

Mots clés

Learning Deep Neural Networks Model Selection

Domaines

Réseau de neurones [cs.NE]

Fichier principal

ECAI-632.pdf (537.99 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ludovic Arnold : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00488338

Soumis le : mardi 1 juin 2010-17:34:08

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : vendredi 19 octobre 2012-15:30:30

Dates et versions

hal-00488338 , version 1 (01-06-2010)

Identifiants

HAL Id : hal-00488338 , version 1
DOI : 10.3233/978-1-60750-606-5-915

Citer

Ludovic Arnold, Hélène Paugam-Moisy, Michèle Sebag. Unsupervised Layer-Wise Model Selection in Deep Neural Networks. 19th European Conference on Artificial Intelligence (ECAI'10), Aug 2010, Lisbon, Portugal. ⟨10.3233/978-1-60750-606-5-915⟩. ⟨hal-00488338⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY INSA-GROUPE UDL ANR

768 Consultations

273 Téléchargements

Unsupervised Layer-Wise Model Selection in Deep Neural Networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager