Hyperparameter optimization with approximate gradient

Fabian Pedregosa

Communication Dans Un Congrès Année : 2016

Hyperparameter optimization with approximate gradient

(1)

Fabian Pedregosa

Fonction : Auteur

CEntre de REcherches en MAthématiques de la DEcision

Résumé

Most models in machine learning contain at least one hyperparameter to control for model complexity. Choosing an appropriate set of hyperparameters is both crucial in terms of model accuracy and computationally challenging. In this work we propose an algorithm for the optimization of continuous hyperparameters using inexact gradient information. An advantage of this method is that hyperparameters can be updated before model parameters have fully converged. We also give sufficient conditions for the global convergence of this method, based on regularity conditions of the involved functions and summability of errors. Finally, we validate the empirical performance of this method on the estimation of regularization constants of L2-regularized logistic regression and kernel Ridge regression. Empirical benchmarks indicate that our approach is highly competitive with respect to state of the art methods.

Domaines

Apprentissage [cs.LG]

Fabian Pedregosa : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01386410

Soumis le : lundi 24 octobre 2016-10:22:33

Dernière modification le : vendredi 26 avril 2024-13:44:38

Dates et versions

hal-01386410 , version 1 (24-10-2016)

Identifiants

HAL Id : hal-01386410 , version 1
ARXIV : 1602.02355

Citer

Fabian Pedregosa. Hyperparameter optimization with approximate gradient. Proceedings of the 33rd International Conference on Machine Learning, Jun 2016, New York, United States. ⟨hal-01386410⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-DAUPHINE INSMI CEREMADE PSL

104 Consultations

0 Téléchargements

Hyperparameter optimization with approximate gradient

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager